Skip to content
View Sumitkumar005's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report Sumitkumar005

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Sumitkumar005/README.md

πŸ‘‹ Hi, I'm Sumit Kumar

πŸš€ AI Engineer | ML Research Engineer | Full Stack Developer

Portfolio LinkedIn Email GitHub

Profile Views


🎯 About Me

class AIEngineer:
    def __init__(self):
        self.name = "Sumit Kumar"
        self.role = "AI/ML Engineer & Full Stack Developer"
        self.education = "IIT Madras - BS in Data Science & Programming"
        self.location = "Bengaluru, India"
        self.expertise = [
            "Generative AI & LLMs",
            "Computer Vision & 3D Reconstruction", 
            "Backend Architecture & APIs",
            "MLOps & Cloud Deployment"
        ]
        
    def current_focus(self):
        return {
            "πŸ”¬ Research": "Multimodal AI & Time Series Forecasting",
            "πŸ—οΈ Building": "AI-Powered Production Systems",
            "πŸ“š Learning": "Advanced Agent Architectures & Graph Neural Networks",
            "🌟 Goal": "Transforming AI Research into Scalable Solutions"
        }

πŸ”Ή AI Engineer passionate about building impactful solutions in Computer Vision, NLP, and Generative AI
πŸ”Ή Skilled in designing end-to-end ML systems: RAG chatbots, 3D vision models, multimodal AI
πŸ”Ή Experienced in backend API development, microservices, and cloud-based MLOps
πŸ”Ή Thriving on turning complex AI research into production-ready solutions that drive real-world impact


πŸ“Š GitHub Statistics

GitHub Stats

GitHub Streak

Top Languages


πŸ’Ό Professional Experience

🏒 Current Roles

πŸ”Ή AI Engineer Intern
πŸ“ ForeignAdmits (VisaMonk AI) | Bengaluru
πŸ“… July 2025 – Present

  • Built FA-Admission Backend with Node.js, Express.js, MongoDB
  • Developed AI-powered University Chatbot with RAG pipeline & FAISS
  • Created AI Document Processing System using Tesseract OCR & GPT-4
  • Engineered Email Outreach Platform with automated AI content generation

πŸ”Ή AI/ML Research Engineer
πŸ“ Freelancer | Remote (South Korea)
πŸ“… Oct 2025 – Present

  • Multimodal Emotion Recognition: 92.53% accuracy on IEMOCAP
  • Fashion Trend Forecasting with ensemble models (N-BEATS, PatchTST)
  • Research on Graph Neural Networks & adaptive modality weighting
  • MLOps pipelines with uncertainty quantification

🎯 Recent Positions

πŸ”Ή Backend Developer (Freelancer) | ElitCeler Technologies | Aug 2025 - Oct 2025

  • Architected RESTful APIs for 2 full-scale e-commerce platforms
  • Built Bazar Story & Printrove WMS backends with 50+ endpoints
  • Integrated AWS S3, Shopify OAuth, payment gateways

πŸ”Ή AI & Data Science Intern | HTS Tech Solutions | Mar 2025 - Jul 2025 | PPO Received βœ…

  • YOLOv11-based rust detection for cell towers: 85% accuracy
  • 3D Model reconstruction using OpenMVG/OpenMVS
  • Reduced model build time from 12 hours β†’ 3-4 hours
  • Report delivery time: 3 days β†’ <24 hours

πŸ”Ή Product and AI (Freelancer) | Arfve | Stockholm, Sweden | Apr 2025 - Jul 2025

  • AI agent-driven lead generation & automation
  • UX improvements & prototype features for accelerator cohort

πŸ”Ή Full-Stack Developer | Devvoy | Jan 2025 - May 2025

  • AI-powered therapy platform with LLM-driven dialogues
  • Voice-enabled interactions using React, FastAPI, ElevenLabs TTS
  • Mentored 3+ contributors on Git workflows & deployment

πŸ› οΈ Tech Stack

πŸ’» Languages

Python C++ JavaScript TypeScript Java

🌐 Backend & APIs

FastAPI Node.js Express.js Django Flask GraphQL

πŸ€– AI/ML & LLMs

PyTorch TensorFlow LangChain Hugging Face OpenAI OpenCV

πŸ›’οΈ Databases

PostgreSQL MongoDB Redis MySQL Supabase FAISS Pinecone

☁️ Cloud & DevOps

AWS GCP Azure Docker Kubernetes GitHub Actions

🎨 Frontend

React Next.js Tailwind CSS


πŸš€ Featured Projects

Automated trucking dispatch system with AI-powered voice calls

Tech: FastAPI, PostgreSQL, Vapi.ai, Twilio
Impact: 90% reduction in manual dispatch operations

Features:

  • AI-driven voice conversations
  • Real-time webhook processing
  • International call support
  • Driver management APIs

Comprehensive code quality assessment across 10+ languages

Tech: FastAPI, Google Gemini AI, FAISS, MongoDB
Features:

  • RAG engine for codebase Q&A
  • AST parsing for security vulnerabilities
  • GitHub integration
  • Real-time progress tracking

Scalable AI-powered voice calling platform

Tech: Node.js, Twilio, Supabase, Groq, Deepgram
Features:

  • RESTful APIs with RBAC
  • Job queues for campaign management
  • Speech-to-text transcription
  • WebRTC integration

Full-stack chatbot with vector search and real-time processing

Tech: Python, FAISS, Redis, Flask
Highlights:

  • 90% accuracy with hybrid RAG
  • Web scraping & data indexing
  • Multi-tenant deployment
  • TTS generation

CNN-based defect detection for manufacturing

Tech: Keras, Flask, OpenCV, Node.js
Results:

  • 93% detection accuracy
  • 18x faster inspection time
  • 7,000+ training images

YOLOv11 + 3D reconstruction pipeline

Tech: YOLOv11, OpenMVG/OpenMVS, Node.js
Achievements:

  • 85% detection accuracy
  • 12 hrs β†’ 3-4 hrs model build time
  • 3 days β†’ <24 hrs report delivery

πŸ“‚ View More Projects | πŸ“Š Data Science Projects


πŸ† Achievements & Certifications

πŸ₯‡ Top 3 in Industrial AI Solutions Hackathon (2024)
πŸ“° Published Machine Learning research in IIT Madras Newsletter (Nov 2024)
πŸ‘₯ Led 200+ students programming community with Codeforces/LeetCode challenges
πŸŽ“ BS in Data Science & Programming from IIT Madras (2024-2027)
πŸ’Ό PPO Received from HTS Tech Solutions (2025)


πŸ“ˆ Contribution Graph

Activity Graph


🎯 Core Competencies

Domain Skills
πŸ€– Generative AI RAG Architecture, Multi-Agent Systems, Prompt Engineering, Function Calling, LangChain, LlamaIndex
🧠 Machine Learning CNNs, Transformers, Graph Neural Networks, YOLOv11, LoRA/QLoRA Fine-tuning, Multimodal AI
πŸ—οΈ Backend Engineering REST APIs, GraphQL, Microservices, JWT/OAuth, RBAC, API Gateway, Rate Limiting
☁️ MLOps & Cloud Model Deployment, Drift Detection, AWS SageMaker, Docker/Kubernetes, CI/CD Pipelines
πŸ“Š Data Engineering ETL Pipelines, Apache Spark/Kafka, Vector Databases, Big Data Processing
πŸ”§ System Design Scalable Architecture, Load Balancing, Caching Strategies, Database Optimization

πŸ’‘ What I'm Currently Working On

πŸ”¬ Research:
  - Multimodal Emotion Recognition with Graph Neural Networks
  - Fashion Trend Forecasting using Ensemble Time Series Models
  - Adaptive Modality Weighting for Robust AI Systems

πŸ—οΈ Building:
  - AI-Powered Voice Communication Systems
  - Document Processing Pipelines with OCR & LLM Integration
  - Scalable RAG Architectures for Enterprise Applications

πŸ“š Learning:
  - Advanced Agent Architectures (ReAct, Reflexion)
  - Real-time Streaming AI Applications
  - Production MLOps Best Practices

πŸ“« Let's Connect!

LinkedIn Email Portfolio GitHub

πŸ’¬ Open to collaborations on AI/ML projects, research opportunities, and interesting tech challenges!


🌟 "Transforming AI Research into Production-Ready Solutions" 🌟

Typing SVG

⭐ If you find my work interesting, feel free to star my repositories! ⭐

Pinned Loading

  1. QA_TESTER QA_TESTER Public

    Python

  2. custom-outreach-application custom-outreach-application Public

    TypeScript

  3. Voice-AI-Hemut-Frontend Voice-AI-Hemut-Frontend Public

    JavaScript

  4. VoxFlow.ai VoxFlow.ai Public

    JavaScript