Anish Bhat

AI/ML Engineer & Developer

ANISH BHAT

Building intelligent systems with Generative AI, Agents, Automation, and Product Development.

About Me

AI/ML Engineer and backend developer with hands-on experience designing scalable backend systems, API-driven architectures, and production-grade ML pipelines.

Proficient in Python, OOP, and clean code practices with a strong emphasis on writing test cases, debugging production issues, and maintaining reliable version control workflows.

Proven track record of shipping end-to-end systems—from architecture design through testing and deployment—with measurable impact on performance, reliability, and business outcomes.

10+Projects Built
3Research Publications
3+Years building project experience
2Hackathons Won

Academic Background

Bachelor of Engineering in Computer Science (Artificial Intelligence)

KLE Technological University, Hubballi, Karnataka

2022–2026GPA: 7.63/10

Pre-University (PCMB)

Expert PU College, Mangaluru, Karnataka

2020–2022Score: 93.16%

Honors & Certs

Certifications

Getting Started with AI on Jetson Nano2026
Neo4j Certified Professional: Graph Data Science2026
Building Knowledge Graphs with LLMs2026

Awards

🏆

1st Place, College Mechatronics Competition (130+ teams)

🏆

Hackathon Finalist & Team Lead, Smart India Hackathon and HackKarnataka

Technical Arsenal

AI & Machine Learning

Deep Learning
Deep Learning70%
Computer Vision
Computer Vision68%
NLP
NLP65%
Generative AI
Generative AI67%
Transformers
Transformers65%
RAG
RAG69%
Agentic Systems
Agentic Systems65%

Frameworks & Libraries

PyTorch
PyTorch70%
TensorFlow
TensorFlow68%
HuggingFace
HuggingFace66%
OpenCV
OpenCV70%
LangChain
LangChain65%
n8n
n8n67%
Pandas
Pandas69%
NumPy
NumPy70%

Backend & Web

Python
Python70%
C++
C++62%
HTML
HTML70%
Next.js
Next.js65%
FastAPI
FastAPI68%
Flask
Flask66%
React
React65%

Databases & Cloud

PostgreSQL
PostgreSQL68%
MongoDB
MongoDB65%
Pinecone
Pinecone65%
Firebase
Firebase70%
Docker
Docker68%
GCP
GCP64%
Vercel
Vercel68%

Professional Experience

Freelance AI Engineer (Lead Developer)

Freelance Hubballi, India
Apr 2025 – Apr 2026
  • Led development of a real-time vision-based assistive system using object detection and scene understanding for visually impaired users.
  • Designed optimized computer vision inference pipelines using OpenCV and deep learning models, achieving 2.5× faster inference on Raspberry Pi through INT8 quantization.
  • Integrated camera-based perception with Firebase APIs and Android applications for real-world deployment.
  • Containerized complete vision pipelines using Docker and implemented CI/CD workflows, reducing deployment time by 35%.
  • Led and coordinated a 3-member engineering team, ensuring on-time delivery of embedded AI milestones.
OpenCVDeep LearningFirebaseDockerCI/CDRaspberry PiINT8

Embedded AI Intern

Decibels Lab Bengaluru, India
Feb 2025 – Apr 2025
  • Trained and evaluated YOLOv11 object detection models on KITTI dataset using statistical metrics and validation techniques, achieving over 65% model size reduction through structured pruning and INT8 quantization.
  • Performed data preprocessing and feature engineering on autonomous driving datasets to improve detection accuracy across Raspberry Pi and Jetson Nano edge devices.
  • Built a scalable backend ML pipeline integrating real-time object detection, lane detection, and sensor fusion with end-to-end latency of approximately 210ms; debugged latency bottlenecks and sensor sync failures to meet production performance targets.
  • Deployed optimized deep learning models on embedded hardware, implementing REST APIs for telemetry and system monitoring; managed codebase on GitHub with clean branching and documented commit history.
YOLOv11INT8 QuantizationRaspberry PiJetson NanoREST APIGitHub
01

Svarra

Multi-Agent AI Voice Automation Platform handling autonomous inbound/outbound voice calls at scale.

Problem Solved

Automating complex conversational workflows like lead qualification, appointment booking, and follow-ups securely.

Key Outcomes

  • Exposed modular REST APIs for 4 specialized agents.
  • Implemented idempotent webhook handling for CRM updates and calendar bookings.
  • Delivered a real-time analytics dashboard tracking call duration, cost, and conversion rates.
  • Enabled multi-lingual voice interactions for the Indian market.
PythonNext.jsFastAPIPostgreSQLSupabaseVapiTwilion8nGitHubVercel
Svarra
4
Agents
Python
Stack
02

Offline P2P Blockchain Wallet

A decentralized offline peer-to-peer blockchain payment system that enables secure financial transactions without internet connectivity, bridging the gap for rural and underserved communities.

Problem Solved

Financial inclusion in India remains a challenge where internet connectivity is unreliable. Traditional digital payments exclude millions. This solution prevents double-spending offline and ensures trustless transactions without continuous internet access.

Key Outcomes

  • Built a secure, truly offline-first decentralized payment system
  • Eliminated reliance on continuous internet connectivity for rural users
  • Solved offline double-spending via local nonce tracking and on-chain validation
ReactFirebaseethers.jsSQLiteBlockchainCryptography
Offline P2P Blockchain Wallet
Rural/Offline
Target Audience
Cryptographic
Security
03

Embedded ADAS Prototype

Designed an end-to-end embedded Advanced Driver Assistance System integrating realtime object detection, lane detection, ultrasonic sensing, and motor control.

Problem Solved

Building an affordable and deployable ADAS prototype on low-power edge hardware.

Key Outcomes

  • Achieved ~6 FPS real-time object detection inference on a Raspberry Pi.
  • Integrated ultrasonic sensors and motor control for real-world interaction.
  • Developed a Flask-based web interface for real-time visualization, telemetry, and experimental evaluation.
Raspberry PiYOLOv12nINT8 QuantizationFlask
Embedded ADAS Prototype
6 FPS
Inference
04

YOLO-OptiMob

Built a complete optimization pipeline for deploying YOLO11 object detection models on resource-constrained edge devices.

Problem Solved

Massive memory footprints and slow inference times of default object detection models on mobile devices.

Key Outcomes

  • Selected 40% L1 unstructured pruning as optimal trade-off.
  • Reduced model size from 11.4 MB to 2.5 MB.
  • Successfully converted to TFLite for deployment on Android-based edge platforms.
YOLO11TFLiteL1 PruningINT8 Quantization
YOLO-OptiMob
78%
Size Reduction
2.5MB
Final Size
05

Vision-Language Scene Understanding (BLIP)

Developed a real-time vision-language system using BLIP to generate natural language scene descriptions from live visual input.

Problem Solved

Enabling semantic, natural language reasoning over complex visual scenes rather than just bounding boxes.

Key Outcomes

  • Generated structured textual representations of scenes to enable downstream reasoning by large language models.
  • Achieved 82% accuracy through dataset curation, fine-tuning, and inference optimization.
BLIPPyTorchVision-Language Models
Vision-Language Scene Understanding (BLIP)
82%
Accuracy
06

Agentic RAG Medical Data Extractor

Designed a medical document RAG pipeline using Gemini embeddings, Pinecone, Firebase, and GCP APIs.

Problem Solved

Extracting highly specific, structured summaries from massive, unstructured medical PDFs securely.

Key Outcomes

  • Extracted structured summaries from unstructured medical PDFs for research use.
  • Automated the workflow reliably via Docker and n8n.
Gemini EmbeddingsPineconeFirebaseGCP APIDockern8n
Agentic RAG Medical Data Extractor
Medical
Domain
07

Elderly Care Full-Stack Platform

Developed a full-stack platform enabling ambulance booking, medicine ordering, and elderly care services.

Problem Solved

Providing an accessible, all-in-one care hub for elderly patients to manage their critical services.

Key Outcomes

  • Delivered a robust, real-time application for medicine and ambulance orchestration.
  • Secured platform with Firebase Auth.
MERNFirebaseMongoDBReactNode.js
Elderly Care Full-Stack Platform
MERN
Stack

Research Publications

ICCIS2025

LongDocAI: A Quantized Modular Pipeline for Multimodal PDF Summarization and QA

The rapid expansion of scientific literature has made it increasingly difficult for researchers to extract meaningful insights from dense academic documents. We introduce LongDocAI, a multimodal, multi-model framework designed to efficiently understand and interact with scholarly PDFs at scale. The system combines Donut, an OCR-free document parser, with a hybrid summarization module based on BARTlarge- CNN, trained respectively on over 200,000 paper-summary pairs and 30,000+ question-answer examples. To ensure faster and resourceefficient inference, we apply LLM.int8 post-training quantization across all transformer models, significantly reducing memory usage and latency without compromising output quality. LongDocAI is built as a unified pipeline that handles both layout-aware parsing and semantic-level understanding through summarization and interactive question answering. In our evaluations, the system achieved a 15% improvement in ROUGEL, a 28% boost in METEOR, and a QA accuracy of 85.3%, based on expert human assessments. Quantization further led to a 70% reduction in model size and a 40% decrease in inference time, making the system suitable for real-time or edge deployment. By integrating multimodal document understanding, summarization, question answering, and quantization into a single, modular pipeline, LongDocAI offers a scalable, lightweight solution for navigating and understanding long-form academic texts—benefiting researchers, reviewers, and knowledge platforms alike.

Read Abstract
CIS2025

Embedded Intelligent ADAS Car Prototype Using Raspberry Pi and YOLOv12n

This paper presents the design, quantization, and deployment of an embedded Advanced Driver Assistance System (ADAS) prototype on a Raspberry Pi 3, utilizing a custom-trained YOLOv12n object detection model. The system achieves real-time multi-class object detection and autonomous control of a test vehicle within the resource constraints of low-power edge devices. To optimize performance, posttraining INT8 quantization was applied to YOLOv12n, resulting in over 65% reduction in model size and nearly 3× faster inference, with minimal accuracy loss. Object detection outputs are combined with ultrasonic sensor measurements to enable obstacle-aware vehicle control, including braking and navigation. Lightweight classical computer vision methods facilitate lane detection for lane-keeping functionality. Additionally, a Flask-based dashboard streams detection overlays and telemetry data for monitoring. The deployed system operates at approximately 6 frames per second on the Raspberry Pi 3 with a responsive control latency of around 210 ms, demonstrating the practical feasibility of deploying deep learning–based ADAS functions on low-cost, resource-constrained embedded platforms. This work highlights the effectiveness of quantization techniques in enabling real-time perception for embedded autonomous driving applications.

Read Abstract
SPIN2025

YOLO-OptiMob: A Pipeline for Optimizing YOLO11 Models for Edge Deployment

This paper introduces YOLO-OptiMob , a comprehensive pipeline for optimizing YOLO11 models for deployment on edge devices. The process begins with creating and preprocessing a custom dataset featuring seven object classes: bike, car, cat, dog, person, handbag, and water bottle. The YOLO11 model is trained on this dataset and optimized using L1 unstructured pruning, with rates of 30%, 40%, and 50% evaluated. Based on the results, a 40% pruning rate was selected as it offered the best balance between model size reduction and accuracy retention. Post-training INT8 quantization further compresses the model, reducing its size from 11.4 MB to 2.5 MB. The optimized model is then converted into TensorFlow Lite (TFLite) format, ensuring compatibility with Android-based edge devices. This work presents a practical pipeline for efficiently adapting YOLO11 to resource-constrained environments, achieving significant size reduction while maintaining high detection accuracy.

Read Abstract

Let's Build

Whether it's a freelance AI pipeline, a scalable backend, or a full-stack product, I'm ready to collaborate.