Building a production RAG pipeline for chemical patents
Lessons from shipping semantic search over 10M+ chemical patents — chunking strategy, embedding choices, and retrieval quality.
AI Engineer · Bengaluru
Currently at Simreka engineering a chemical process SaaS. Interested in edge AI, biosensors, and products with real technical depth.
Building Process Experiment V2 — a chemical process engineering SaaS platform for simulation, LCA analysis, and patent intelligence.
Biosensor wearable + LLM for emotional intelligence
End-to-end platform: BLE signal acquisition from HRV/GSR sensors, edge processing, and LLM-powered cognitive/emotional state interpretation. Designed with a data moat strategy using event sourcing.
Life Cycle Assessment for chemical process simulation
Full-stack LCA reporting system inside Simreka's Process Experiment platform. Includes 11 DB migrations, repository pattern, Eloquent models, and Pinia store with simulation callback routing.
Semantic search over 10M+ chemical patents
Production RAG pipeline using LangChain orchestration and Milvus vector database. Handles chunking, embedding, and retrieval for chemical process engineering knowledge queries.
AI-assisted multi-language codebase navigation
Structured development workflow using CLAUDE.md context files, .claudeignore, atomic task decomposition, and session extraction prompts for navigating large TypeScript + Python codebases.
Lessons from shipping semantic search over 10M+ chemical patents — chunking strategy, embedding choices, and retrieval quality.
Why event sourcing makes sense when your data is physiological, time-series, and needs to power a future AI moat.
A subtle stale watcher was setting sessionExpired after a successful refresh. Here's how I tracked it down.
Open to AI engineering roles — ideally working on LLM systems, RAG infrastructure, or products with real technical depth. Also happy to talk about MindPattern, biosensors, or edge AI.
Actively looking · Bengaluru / Remote