Debjyoti Ray

I'm a ML Engineer at Miiflow Inc., where I work on machine learning, and applied AI engineering.

My research interests include multimodal AI, diffusion models, reinforcement learning, interpretability of language models, and multi-agent systems. I've worked on scalable OCR pipelines, biomedical knowledge graphs, and ensemble trading agents. I'm pursuing a B.Tech in Electronics and Communication Engineering at IIIT Allahabad and a B.S. in Data Science at IIT Madras. I am currently really interested in image-vision diffusion models, agentic systems and scalable architectures.

Email / LinkedIn / GitHub / CV

Education

	Bachelors of Technology(B. Tech) in Electronics and Communication Engineering Indian Institute of Information Technology, Allahabad Expected October 2026 GPA: 8.4/10
	Bachelors of Science(B.S.) in Data Science Indian Institute of Technology, Madras Expected June 2026 GPA: 8.13/10

Experience

My professional experience spans AI engineering, machine learning research, and quantitative finance.

	ML Engineer Miiflow Inc. \| Remote August 2025 - Present • Engineered a unified LLM orchestration layer for Miiflow replacing LlamaIndex, supporting 7+ providers with <200 ms latency overhead, streaming, ReAct reasoning, and 95% test coverage. • Enhanced Miiflow's agentic memory framework by implementing tool-calling safety, OpenTelemetry tracing, and CI/CD pipelines—boosting developer efficiency and reducing dependency overhead by ~40%.
	Software Dev Engineer Intern Amazon Web Services (AWS Q for Business) \| Bangalore, India May 2025 - July 2025 • Integrated real-time voice-driven bidirectional streaming APIs into AWS Q for Business using Coral (Amazon's Java/Spring framework), enabling seamless voice-to-LLM-voice interactions via Bedrock-hosted models and EC2-deployed services. • Architected and deployed a scalable microservice-based voice interface as an independent service layer within AWS Q, using Amazon Bedrock, SageMaker, and EC2, and designed with PACELC to ensure persistent sessions with low-latency.
	AI Engineer Intern TransHumanity \| Remote(London, United Kingdom) April 2025 - May 2025 • Packaged an independent Python‑based text→SQL CLI tool (text_to_sql.py) that parses 4+ parameters (vehicle type, sensor, date range, aggregation) from natural language and emits valid PostgreSQL queries with 98% syntactic accuracy, cutting manual query writing time by 70%. • Engineered multi‑layered SQL validation (basic & schema‑aware) to block dangerous operations and mis‑typed tables/columns, reducing invalid‑query failures by 85%. • Developed an intelligent correction system—rule‑based fixes for common typos, LLM‑based fallback for complex errors, plus a self‑learning loop—that auto‑corrects 92% of syntax/name mistakes and preempts 60% of recurring error patterns.
	AI Engineer ALMA Inc. \| Remote (San Francisco Bay Area) Feb 2025 - Apr 2025 • Resolved 40+ support tickets via Notion, Linear, and Slack; on user onboarding by deploying seamless OCR and parameter extraction pipelines from submitted legal documents, improving user intake efficiency by 23%. • Engineered a multi-agent document extraction pipeline, increasing OCR accuracy by 45% and reducing data-entry errors by 29%, using: Mistral-ocr-latest for markdown conversion; Pixtral-12B; Mistral-2B SLM as a post-processing agent.
	ML Research Intern Crecientech in association with University of Missouri-Columbia (Profs. Dong Xu & Gyan Srivastava) \| Remote (Boston, Massachusetts) Dec 2024 - Feb 2025 • Built a biomedical knowledge graph pipeline processing 500+ PubMed abstracts with MedPalm-2, achieving 92% accuracy in protein-pathway mapping cross-referenced with STRING-db, and Reactome. • Scaled automated graph construction by 22.71%, creating 3000+ entities and relationships, used multi-agent architecture for, leveraging small language models (SLMs) to reduce inference time and latency.
	Research and Business Lead (Co-Founder) Sapphire: AI Trader \| India August 2023 - Present • Developed a novel ensemble trading agent using multiple SOTA reinforcement learning algorithms combined with LLMs on stock market data, achieving a 20% increase in annualized returns and a 15% reduction in volatility. • Directed an 8-member team under a Board of Advisors in parallel for algorithm development and interface deployment; achieving a 30% growth in net positive independent portfolio over 3 months. • Top 4 teams out of 150,000 entrepreneurs in the STPI [Startup India Finals] – Neuron, Mohali OCW 7.0 • Raised a research grant with one of the highest valuations from NewGen IEDC, IIIT-Allahabad (Incubator) and Xartup
	ML Research Intern Crecientech in association with University of Missouri-Columbia (Profs. Dong Xu & Gyan Srivastava) \| Remote (Boston, Massachusetts) May 2024 - July 2024 • Engineered a fine‑tuned BioBERT model by curating ~500 Q&A pairs from 30 PubMed papers—boosting structured‑format adherence by 45% and factual accuracy by 30%. • Developed an Entrez‑powered citation module using NCBI E‑utilities to fetch top n PubMed results and rank via review‑weighting + 0.85ⁿ decay—achieving 88% relevancy precision. • Built a high‑throughput backend orchestrator handling 50 queries/sec, merging LLM answers with PMID/DOI links in < 3s.
	Quantitative Research Consultant Intern WorldQuant \| Remote (Mumbai, India) January 2024 - May 2024 • Developed over 150 predictive alphas to forecast market trends in the USA and China, using comprehensive price-volume and sentiment datasets; delivered alpha submissions with Sharpe ratios under 3.45 and fitness above 1.2. Certificate

Publications

Target and Biomarker Exploration Portal for Drug Discovery research
Bhupesh Dewangan, Debjyoti Ray, Sameera Devulapalli, Yijie Ren, Shraddha Srivastava, Shilpi Chaurasia, Lei Jiang, Muneendra Ojha , Dong Xu, Gyan Srivastava
BioInformatics, 2025 (Under Review)
paper / Live version of the tool / Team / Tutorial videos

A comprehensive knowledge graph-based portal for drug discovery research focusing on biomarker exploration and target identification, integrating data from multiple biomedical databases.

Deep Q-Snake: An Intelligent Agent Mastering the Snake Game with Deep Reinforcement Learning
Debjyoti Ray; Arindam Ghosh; Muneendra Ojha ; Krishna Pratap Singh
IEEE TENCON-2024
paper / code / certificate / Demo video

This paper explores the application of Deep Q-Learning to create an intelligent agent capable of mastering the classic Snake game, demonstrating the effectiveness of reinforcement learning in game environments.

Currently Working On

TAU-KG Knowledge-Graph Pipeline
Under guidance of Dr. Gyan Srivastava & AstraZeneca | Feb 2025 – Present

• Processed 4 full-text PDFs (≈240 total 256-token chunks) and embedded each chunk into a FAISS index (sub-100 ms lookup); migrating to Weaviate and ChromaDB for richer metadata support and horizontal scaling.
• Retrieved top-5 similar chunks per entity (avg. cosine ≥ 0.83) and fed them to a fine-tuned GPT-4o model to extract 150+ entity–relation triples into a standardized JSON schema.
• Computed Pearson correlations over 1,200+ node pairs, applied Louvain community detection to identify 5 biologically coherent clusters (pathways/diseases), and validated cluster significance (p < 0.05) across 300+ tests.
• Calculated degree, betweenness, and modularity metrics for 200+ nodes; visualized insights in Streamlit; planning Neo4j migration to support >10 K edges in production.
Streamlit Demo / Hosted API / GitHub / Documentation and report [to be out soon]

Second-Brain AI Assistant
Self-Directed | March 2025 – Present (Stage 2)

• Currently implementing Stage 2 data-engine pipelines: orchestrating a Python ETL workflow to ingest Notion snapshots and crawled web resources, apply heuristic and LLM-based quality scoring, and store processed documents in MongoDB for downstream retrieval.
• Leveraging ZenML for reproducible pipeline orchestration and Opik to benchmark RAG feature extraction and retrieval efficacy.
• Will fine-tune a Llama 3.1 8B summarization model in Stage 4 using Unsloth-managed experiments tracked via Comet, to generate concise, high-fidelity knowledge digests.
• Will evolve the system in Stage 6 by integrating agentic RAG inference with Smolagents and building an interactive online interface—roadmap to adapt based on user feedback and emerging best practices.

Projects

	Amazon ML Challenge 2025: Multimodal Pricing Engine October 2025 • Architected a multimodal price prediction system using cross-attention fusion of image (SigLIP-giant, 768-dim) and text (Gemma-300M, 384-dim) embeddings, replacing naive concatenation with dual-pathway attention mechanism (8 heads) for context-aware feature interaction. • Engineered a 3-layer MLP regression head with BatchNorm1d, GELU activation, and 25% dropout, optimized via custom SMAPE loss and AdamW with cosine annealing scheduler to directly align training with competition metrics. • Achieved top 20 ranking out of 80,000+ participating teams through systematic architecture design balancing model capacity, regularization, and optimization stability. Report
	Amazon ML Challenge 2024: Feature Extraction from Images September 2024 • Processed 260,000+ images using PaddleOCR and Qwen-2VL-2B, fine-tuning the latter on 50,000 images by freezing non-critical layers and applying LoRA adapters with 4-bit quantization and FP16 precision. Evaluated performance based on multi-language accuracy and text orientation handling; ranking us 12th out of 75,000+ participating teams. • Utilized A100 and T4 GPUs for parallel execution, employing Liger kernels, gradient accumulation, and cosine learning rate scheduling to maximize throughput while minimizing computational overhead. Report
	StreamShop: Real-Time Object Detection September 2024 • Composed a few-shot dataset by overlaying 5–10 Amazon Berkeley product cutouts per Panchayat scene—yielding thousands of unique training samples • Augmented data with 4 methods (rotation, scaling, translation, color jitter) to quadruple scene diversity and bolster model generalizability • Fine-tuned the DeVIT few-shot ViT (shot = 10) via meta-learning on custom pipelines (61% Python) for real-time, low-data object detection • Co-ordinated with team to integrate a Flask (Python) backend & Next.js (TypeScript) frontend prototype to serve model inference seamlessly in Prime Video streams Demo Video / GitHub

Hackathons

	Start-a-thon 2023 Startup India National Competition • Researched and architected an ensemble trading agent blending PPO, A2C, and SAC DRL algorithms—achieving a 20% higher annualized return compared to individual agents • Implemented dynamic risk management with drawdown caps and volatility scaling—delivering a 1.5 Sharpe ratio and 30% lower max drawdown • Conducted rigorous backtesting (2010–2021 training; 2021–2024 validation), validating a consistent 25% alpha over DJI and S&P 500 benchmarks • Deployed in live markets since 2024—sustaining 99% uptime and outperforming major indices by 15% in NSE execution
	Adobe Gensolve 2024 Top 50 Nationwide • Analyzed 500+ hand‑drawn curve samples, extracting 10+ geometric features (e.g., curvature, length, closure) to drive algorithm selection and parameter tuning • Regularized shapes by implementing detection and fitting for 5 curve classes (lines, circles, ellipses, rectangles, polygons), achieving 96% classification accuracy on a 1,000‑curve test set • Inpainted 150+ occluded curves with a partial‑convolution network in partialconv/, reaching a mean IoU of 0.88, and deployed the end‑to‑end pipeline as a Streamlit app for real‑time demonstration Certificate
	Google GEN-AI Hackathon 2024 Top 200 Teams Globally • Built RESTful endpoints in Flask to orchestrate generative AI tasks via Google Gemini‑Pro, embedding a safety layer to guarantee valid JSON outputs across all routes • Built a modular Next.js/React frontend with real‑time AI response streaming, responsive Tailwind layouts, and React hook state management—achieving sub‑1 s update latency for seamless user interaction • Deployed end‑to‑end CI/CD on Vercel with custom middleware (rate‑limiting + CORS) to ensure secure, scalable inference services during the hackathon showcase

Research Summits

All India Research Scholar Summit (AIRSS) 2024
IIT Madras - Best Product Winners
Oral Presentation / Certificate

• Engaged with 5+ venture capitalists to validate Sapphire's strategic vision and incorporate direct investor feedback
• Presented our AI-trading framework to Ashwani Bhatia (SEBI Board Director), securing regulatory insights and endorsement of our risk-management protocols
• Delivered a 20-minute pitch to a panel of 15+ academics, industry leaders, and regulators, spotlighting live trading performance and DRL innovations
• Clinched the Best Product Showcase award out of 200+ national entries, distinguishing Sapphire as India's leading AI-powered trading solution

Exams

JEE & KVPY

JEE MAINS 2022: 99.28 percentile
WBJEE 2022: 99.72 percentile
JEE ADVANCED 2022: 98.02 percentile
KVPY SA 2021: Qualified