Skip to content
View vishvaRam's full-sized avatar

Block or report vishvaRam

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
vishvaRam/README.md

πŸ’« About Me - Generative AI Engineer

A highly passionate and results-driven AI Engineer specializing in Generative AI & Agentic AI Systems.
With 1.5 years of hands-on experience across the full AI project lifecycle, I build, optimize, and deploy scalable AI-driven solutions.


πŸš€ Key Skills

  • Agentic AI & Orchestration: CrewAI, LangChain Agents
  • LLM Inference & Optimization: Llama.cpp Server, vLLM, Ollama
  • Generative AI Applications: Retrieval-Augmented Generation (RAG), Conversational AI, Autonomous Agents
  • Fine-tuning & Cloud Training: RunPod GPU Cloud, LoRA/QLoRA fine-tuning
  • Deployment & Frontend: Streamlit Apps, Docker, API-driven integrations
  • Cloud & Infra: AWS ECR, AWS ECS (Fargate/EC2), Task Definitions, Application Load Balancer, Auto Scaling
  • Programming Languages: Python (core), with focus on ML/AI frameworks

πŸ’Ό Experience

  • 1.5 years building and deploying LLM-powered solutions in production.
  • Designed & optimized RAG-based chatbots, knowledge assistants, and multi-agent workflows using CrewAI/LangChain.
  • Optimized inference paths with Llama.cpp/vLLM/Ollama to reduce latency and cost across multi-model deployments.
  • Built Streamlit dashboards and conversational UIs for rapid iteration and stakeholder demos.
  • Implemented AWS ECR + ECS deployments with robust Task Definitions, environment secrets, autoscaling, and ALB routing.
  • Deployed fine-tuned models on RunPod Cloud GPUs, leveraging LoRA/QLoRA strategies for cost-efficient training.

🌐 Socials

LinkedIn Instagram


πŸ’» Tech Stack

Python
LangChain
CrewAI
Llama.cpp
vLLM
Ollama
Streamlit
Docker
AWS ECR
AWS ECS
RunPod
PyTorch
TensorFlow


πŸ“Š GitHub Stats




Pinned Loading

  1. Structured-Output-Examples-for-LLMs Structured-Output-Examples-for-LLMs Public

    This repository demonstrates structured data extraction using various language models and frameworks. It includes examples of generating JSON outputs for name and age extraction from text prompts. …

    Python 1

  2. Data-Prep-for-LLM-fine-tuning Data-Prep-for-LLM-fine-tuning Public

    This repository helps prepare datasets for fine-tuning Large Language Models (LLMs). It includes tools for cleaning, formatting, and augmenting data to improve model performance. Designed for resea…

    Jupyter Notebook 1

  3. Blog-Writing-Agentic-RAG-CrewAI Blog-Writing-Agentic-RAG-CrewAI Public

    An automated blog writing system that leverages CrewAI to create high-quality, well-researched blog posts. The project implements a multi-agent workflow for researching topics, generating content, …

    Python

  4. Fine-Tuning-LLMs Fine-Tuning-LLMs Public

    Jupyter Notebook