Anton Glenbovitch

20+ Years Building Enterprise Systems

Backend architecture, enterprise applications, data workflows, integrations, and AI-enabled engineering systems. Experience across university IT, health insurance, and independent consulting.

Core Expertise

AI Systems: RAG workflows, LLM evaluation, prompt/control-plane design, agentic orchestration
Backend Engineering: Python, FastAPI, Java, REST APIs, SQL, enterprise integrations
Cloud / DevOps: AWS Lambda, Bedrock, OpenSearch, Docker, CI/CD, observability
System Design: reliability, testing, guardrails, monitoring, cost-aware architecture

Engineering Philosophy

In production AI systems, the main challenges are not model selection, but:

Data quality and preparation
Retrieval accuracy and ranking
System design and integration
Evaluation and continuous monitoring

Reliable systems require guardrails, evaluation metrics, and rigorous testing to control hallucinations and maintain consistency at scale.

Featured Projects

Selected engineering projects showing RAG architecture, evaluation workflows, cloud patterns, guardrails, and backend system design.

Liquid in Glass Particle Simulation

2026 • Interactive Canvas • Portfolio Project

JavaScript Canvas API Particle Simulation Responsive UI

Overview: Browser-native fluid experiment that simulates layered liquids inside a glass with real-time particle physics, tilt/shake/swirl interactions, adjustable particle density, dynamic color blending, and an optional 3D depth mode.

Rendering HTML5 Canvas + typed arrays

Simulation Spatial hash particle solver

Interaction Pointer + keyboard controls

Launch Interactive Demo

Enterprise Claim AI Platform

2024–2025 • Health Insurance Reference Architecture • Portfolio Project

RAG LangChain Pinecone GPT-4 AWS Lambda Python

Overview: Reference architecture for AI-assisted insurance claim analysis using RAG, workflow orchestration, fraud-risk scoring, audit logging, and human review patterns. Designed to demonstrate how an enterprise claims workflow could combine retrieval, LLM reasoning, evaluation, and governance.

Architecture Event-driven AWS workflow

AI Layer RAG + Bedrock/Claude reasoning

Governance Audit logs + evaluation hooks

Deployment Terraform + FastAPI service pattern

Key Technical Decisions

Hybrid Retrieval: Combined semantic search + BM25 ranking for 8% accuracy improvement over semantic-only approach
Evaluation Framework: Automated metrics (ROUGE, BERTScore) + human QA labels for ground truth validation
Cost Optimization: Prompt caching (30% reduction), batch processing, cheaper embedding models
Guardrails: Hallucination detection, conservative "I don't know" responses for out-of-domain queries

Impact

Demonstrates how to structure claim-analysis workflows with retrieval grounding, model routing, evaluation checks, human fallback, and auditable decision metadata.
Shows a practical cloud pattern for regulated AI systems where traceability, conservative responses, and escalation matter as much as model output.

Read Full Project Writeup View on GitHub

RAG Evaluation Framework

📅 2024 🔬 Research & Production ⭐ Open Source

Evaluation LLM Metrics Python Pytest

Overview: Comprehensive evaluation framework for assessing RAG pipeline quality. Combines automated metrics with human-in-the-loop validation to measure retrieval accuracy, generation quality, and hallucination rates.

Metrics Supported 12+ (ROUGE, BERTScore, custom)

Hallucination Detection Automated + human labeling

Benchmark Datasets SQuAD, Natural Questions, custom

Read Full Writeup View on GitHub

Health Insurance Member Q&A Chatbot

📅 2025 💬 Full-Stack 🏥 Healthcare Domain

React Node.js RAG AWS TypeScript

Overview: Full-stack conversational AI system helping health insurance members answer questions about coverage, claims, benefits. Combines React frontend, Node.js backend, and RAG pipeline for accurate, compliant responses.

Response Accuracy 92%

User Satisfaction 4.2/5.0

Hallucination Rate <2%

Cost $0.04/query

Read Full Writeup View on GitHub

Recent Articles

Technical deep dives on RAG, LLM systems, and production AI architecture.

Spec-Driven Development: Moving AI Coding from Experimentation to Production Discipline

June 2026 • 12 min read

Why AI coding needs a contract-first workflow with clear specifications, planning gates, traceable implementation, and verification before production use.

Read on Website →

Building Production RAG: Cost Optimization Strategies

April 2025 • 8 min read

How to reduce RAG pipeline costs by 66% without sacrificing quality. Covers prompt caching, embedding model selection, batch processing, and cost-per-query optimization.

Read on Medium →

Evaluating RAG Systems: Beyond Automated Metrics

March 2025 • 10 min read

Why automated metrics alone fail for RAG evaluation. The case for human-in-the-loop validation, building labeled datasets, and continuous monitoring in production.

Read on Dev.to →

Conversational AI in Healthcare: Domain-Specific Challenges

February 2025 • 12 min read

Lessons from building health insurance chatbots. PII handling, regulatory compliance, conservative response strategies, and maintaining accuracy in regulated domains.