🚀 Senior Full Stack & Mobile Engineer · AI/LLM Integration · System Architecture · Technical Leadership
Senior Full Stack & Mobile Engineer with 9+ years of experience designing, building, and scaling production systems used by over 1 million users. Expertise across backend APIs, cloud infrastructure, Expo/React Native mobile, and Next.js/NestJS web frontends with TypeScript and Prisma ORM.
Focused on AI-native product engineering — shipping OpenAI and Anthropic Claude integrations, production RAG pipelines, and LLM-powered features that cut inference costs by 40% and P95 latency by 55%. Proven impact across backend, mobile, web, and AI/LLM integration — improving team delivery velocity by 18%, API throughput by 20%, and infrastructure costs by 15%.
🚀 Led 3-engineer teams · Shipped apps to App Store & Play Store · Built systems handling 1M+ concurrent users · Reduced infrastructure costs by 15% · Achieved 99.9% uptime · Cut LLM inference costs by 40%
| 🏗️ System Architecture & Design | ⚙️ API Design & Backend Engineering | ☁️ Cloud Infrastructure (AWS) |
| 📱 React Native & Expo Mobile Eng. | 🍎 Cross-Platform iOS & Android Delivery | 🚀 App Store & Play Store Deployment |
| 🤖 LLM Integration & RAG Pipelines | 🧪 Prompt Engineering & Agent Workflows | 🗄️ Vector Databases & Embeddings |
| 🌐 Full Stack SaaS Development | ⚡ Performance Engineering | 👥 Technical Leadership & Mentoring |
| 🗄️ Database Design & Optimization | 🔁 CI/CD, Docker & DevOps | 🔐 Security & Auth Implementation |
Backend
AI / LLM
Mobile
Frontend
Databases & Caching
Cloud & DevOps
🏢 Lead Full Stack Engineer — AI Avatar Platform · Mascotte.AI, Remote (Europe) · Nov 2024 – Present
- 🏗️ Architected and led end-to-end delivery of a B2C AI avatar SaaS (Mascotte.AI v1) on Next.js 14, TypeScript, MySQL/AWS RDS — shipping 67 pages, 50+ API routes, real-time 3D avatars, voice chat, subscription billing, and a full admin panel across 12 languages
- 🔀 Designed and led architectural migration to a decoupled B2B developer platform (v2) — splitting the Next.js monolith into React 19 + Vite 7 frontend and a standalone Express.js + Prisma 6 API, unlocking enterprise embed, SDK, and white-label use cases
- 🎙️ Built the real-time voice AI pipeline — LiveKit WebRTC + Deepgram STT + OpenAI GPT (via OpenRouter multi-model proxy) + ElevenLabs/Google Cloud TTS — delivering sub-second conversational avatars with per-avatar system prompts and RAG knowledge bases
- 🎭 Integrated Arcware Pixel Streaming, Three.js/React Three Fiber with VRM avatars, Canvas 2D editor, head tracking with calibration, and Rhubarb WASM lip-sync — combining photorealistic and stylized avatars in a unified UX
- 🔑 Built the developer platform from scratch — encrypted API key management, webhook system, SDK session tracking, request logging, and embeddable widget with usage analytics — opening a new B2B revenue channel
- 💳 Shipped Stripe + PayPal subscriptions with credit-based usage system, multi-tier plans, coupon/ambassador programs, OAuth (Google/Apple/Facebook/Discord) via NextAuth v4, AWS S3 media storage, and Pusher/Socket.IO real-time events
🏢 Senior Full Stack & Mobile Engineer — Architecture & Platform Lead · VulnCheck, USA (Remote) · Jan 2024 – Sep 2024
- 🏗️ Architected a fintech SaaS platform end-to-end (NestJS, PostgreSQL/Prisma, Next.js, AWS) — scaling to 1M+ active users, reducing system-wide latency by 35%, and improving API throughput by 20%
- 🤖 Shipped the core AI layer integrating OpenAI GPT-4o and Anthropic Claude APIs with streaming, function calling, and structured outputs — powering conversational financial copilots used daily by thousands of users
- 🗄️ Built a production RAG pipeline (pgvector, hybrid semantic/keyword search, Redis caching) tuned for financial documents — cutting LLM inference costs by 40% and P95 answer latency by 55%
- 🔗 Designed multi-step AI agent workflows with tool calling, retries, guardrails, and fallback strategies — deflecting 30% of support tickets without human involvement
- 📱 Shipped a cross-platform Expo + React Native mobile app (iOS & Android) with offline-first sync, biometric auth, push notifications, and an embedded AI assistant — growing mobile to 40% of total platform traffic within 6 months
- ☁️ Architected AWS infrastructure (EC2 auto-scaling, ALB, CloudFront, S3, Bedrock fallback) and established team engineering practices — reducing hosting costs by 15% and lifting team velocity by 18%
🏢 Principal Engineer & Engineering Manager · MERCURY DASHA, Remote · Sep 2022 – Dec 2023
- 👥 Led and mentored a 3-engineer cross-functional team to 100% on-time sprint delivery with Jest + React Testing Library coverage gates above 85%
- 🤖 Integrated OpenAI GPT-4 and text-embedding-ada-002 into search, recommendation, and content generation flows — lifting content discovery engagement by 25% and unlocking a premium AI tier that drove 18% revenue lift
- 📱 Architected and shipped an Expo + React Native mobile app serving 1M+ users — becoming a core channel contributing 35% of total revenue
- ⚡ Designed distributed Redis caching across expensive API endpoints, LLM responses, and PostgreSQL queries — absorbing 3x traffic growth with no additional infrastructure spend
- 🏗️ Built the core NestJS + TypeScript + MongoDB API layer and multi-environment AWS + Heroku infrastructure with blue-green deploys — reducing deploy failures by 90% and eliminating environment drift incidents
🏢 Full Stack & Mobile Engineer — Platform & Systems · PXN Phantom Network, Remote · Mar 2019 – Jul 2022
- 📦 Delivered 30+ production features on a high-traffic global platform (Next.js, Django, PostgreSQL, AWS) with full Jest + React Testing Library coverage and zero post-release regressions
- 🤖 Prototyped and shipped the company's first LLM-powered features using early OpenAI APIs (text-davinci, GPT-3.5) — content summarization, semantic tagging, and a support Q&A assistant that cut response time by 45%
- 📱 Designed and launched an Expo + React Native app for iOS and Android end-to-end — reaching 250k+ downloads and opening a new mobile-first user segment
- 🔄 Led full-stack migration from a legacy PHP monolith to Next.js + TypeScript + Django REST + Prisma — improving maintainability by 40%, halving onboarding time, and enabling independent frontend/backend deployments
- 📊 Next.js SSR/SSG strategies reduced TTFB by 60% and lifted conversion and revenue by 10%; built a Three.js/WebGL 3D marketing experience reaching 2M+ visitors across 12 markets
🏢 Full Stack Developer — Backend & Infrastructure · Node Audio Ltd, Remote · Jun 2018 – Dec 2018
- 🔍 Resolved critical MongoDB performance regressions — profiling queries, redesigning indexes, and rewriting aggregation pipelines — reducing P95 execution time by 35% and supporting 2x concurrent user growth
- ✅ Architected AWS CI/CD pipeline (CodePipeline, EC2, Docker, health checks, automated rollback) — sustaining 99.9% uptime and cutting deployment time from 45 to 8 minutes
- 🔐 Implemented stateless JWT auth with refresh token rotation and RBAC — securing 50k+ active accounts and lifting user engagement by 28%
🏢 Full Stack Developer · Fleamint, Remote · Feb 2015 – Apr 2017
- 💳 Architected a payment processing system with third-party gateway integration, idempotent retries, and webhook handling — reducing transaction failure rates by 18% and processing $2M+ monthly
- 🛒 Built scalable marketplace modules (catalog, cart, order management, seller dashboards) — increasing checkout speed by 22% and supporting growth from hundreds to thousands of daily transactions
- 🔧 Partnered with product leadership on technical feasibility, MVP scope, and phased roadmap — ensuring the team built the right things in the right order
| 👥 1M+ Users Served | 🤖 40% LLM Cost Reduction | 📱 3 Mobile Apps Shipped |
|---|---|---|
| ☁️ 15% Infrastructure Cost Reduction | 🗄️ 35% Query Time Reduction | ✅ 99.9% Uptime |
| 👨💻 7 Engineers Led | 🎯 100% On-Time Delivery | 📦 30+ Production Features |
| ⚡ 55% P95 Latency Improvement | 🔗 30% Support Ticket Deflection | 📈 250k+ Mobile Downloads |
✦ Full-stack depth across backend, mobile, web, and AI — end-to-end ownership
✦ Production LLM engineering — OpenAI GPT-4o & Anthropic Claude, RAG pipelines, agent workflows
✦ Mobile-native: shipped Expo + React Native apps to App Store & Play Store (not just tutorials)
✦ Proven at scale: solved real concurrency, caching & infrastructure problems at 1M+ users
✦ Engineering leader: manage teams AND stay deeply technical simultaneously
✦ Remote-native: 5+ years async across US, Canada & European timezones
🎓 Bachelor of Science, Computer Science — Singapore Institute of Technology




