Generative AI Engineer | Full Stack Developer | Open Source Enthusiast
I'm a Generative AI Engineer with hands-on expertise in LLMs, Retrieval-Augmented Generation (RAG), and production-level deployment of AI-powered applications. Currently pursuing B.Tech in Computer Science & Engineering at Bhilai Institute of Technology, Durg (2022-2025).
I specialize in building scalable AI systems using modern frameworks and cloud infrastructure, with a strong focus on:
- π€ Large Language Models (LLMs) and RAG pipelines
- π Speech-to-Text systems with OpenAI Whisper
- β‘ High-performance microservices architecture
- π Blockchain and Web3 technologies
Aug 2024 - Oct 2025
- π― Built enterprise-grade Agent Assist system improving customer service efficiency by 40%
- ποΈ Architected and deployed ML models including streaming audio transcription with OpenAI Whisper
- βοΈ Developed scalable microservices with Asterisk PBX, Keycloak, Azure, RunPod, and Google Cloud
- π Created RAG pipelines with LangChain and developed frontend with React and Next.js
Apr 2024 - Jul 2024
- π Built RESTful APIs and RAG systems using NLP, Docker, AWS
- π Designed Multi-RAG inference reducing response time by 35%
- β‘ Deployed Faster Whisper ASR server with 50% speed improvement
Jun 2023 - Jul 2023
- π Optimized cryptographic algorithms improving performance by 20%
- π‘οΈ Enhanced AES security and co-authored research on block cipher algorithms
- π Developed robust algorithm for secure block cipher (research paper in progress)
- Frameworks: LangChain, TensorFlow, PyTorch
- RAG Systems: ChromaDB, LanceDB, MongoDB
- Models: OpenAI Whisper, LLMs
- Techniques: Prompt Engineering, NLP, Computer Vision
- Frontend: React, Next.js, HTML, CSS, JavaScript
- Backend: FastAPI, Django, Node.js
- Architecture: REST API, WebSocket, Microservices
- Cloud Platforms: AWS, Azure, Google Cloud, RunPod
- Containers: Docker, Kubernetes
- Tools: Git, GitHub, Linux
- Solidity, Smart Contracts, Web3
π₯ FlowKit
Distributed, graph-based AI workflow orchestration system with isolated execution, secure secret management, and dynamic auto-scaling for high-availability pipeline processing.
Tech Stack: Python, Docker, Kubernetes, Microservices
π€ WhisperS2TServer
High-performance, multi-GPU speech-to-text system using OpenAI Whisper with CTranslate2 backend. Features dynamic auto-scaling for optimized, secure, and resource-efficient audio transcription.
Tech Stack: OpenAI Whisper, CTranslate2, Python, FastAPI, Docker
ποΈ LlamaIndex-MultiRAG
Multi-RAG Server integrating LanceDB and MongoDB with different embeddings for enhanced retrieval performance.
Tech Stack: LlamaIndex, LangChain, LanceDB, MongoDB, Python
- π Machine Learning - AICTE (2023)
- π Google Data Analytics - Coursera (2022)
- π Python - HackerRank (2021)
- π Playing Volleyball
- π» Open-source contributions
- π§ AI Research
- π¬ Blockchain Technology
I'm always open to interesting conversations and collaboration opportunities. Feel free to reach out!
- π§ Email: [email protected]
- πΌ LinkedIn: linkedin.com/in/anshjoseph
- π GitHub: github.com/anshjoseph
- π± Phone: +91 9981634633
βοΈ From anshjoseph | Open to collaborations and opportunities!

