Writing & Insights

Why Human-in-the-Loop Architecture is the Creative Core of Agentic AI Systems

Feb 2026 · 8 min read

In the world of agentic AI and LLM systems, hallucination is the biggest structural challenge. This article explores how human-in-the-loop design becomes the most creative and robust solution in production-grade AI architectures.

Agentic AI Human-in-the-Loop LLM Systems

Read Article →

The Most Important Component of RAG: Building and Evaluating a Retriever

March 2026 · 20 min read

A practical guide to building a baseline retriever for RAG systems. Learn how to implement chunking strategies, generate embeddings, perform vector search using FAISS, and evaluate retrieval performance using Precision@K, Recall@K, and MRR.

RAG Retriever Vector Search LLM Systems

Read Article →

BM25 vs Dense vs Hybrid Retrieval for RAG: Architecture, Trade-offs, and When to Use What

March 2026 · 8 min read

Learn how BM25, Dense Retrieval, and Hybrid Retrieval work in RAG systems, including architecture, ranking strategies, and when to use each approach in production pipelines.

RAG BM25 Dense Retrieval Hybrid Retrieval Vector Search Reranker Information Retrieval

Read Article →

Improving BM25 Retrieval with Query Expansion: A Practical Evaluation Using Precision, Recall, and MRR

March 2026 · 6 min read

An experimental analysis of query expansion techniques for BM25, demonstrating measurable improvements in recall, precision, and MRR.

RAG BM25 Query Expansion LLM Information Retrieval Search Systems

Read Article →

Tree of Thoughts (ToT): Improving Reasoning in RAG and Agentic AI Systems

March 2026 · 5 min read

Improve LLM reasoning using Tree of Thoughts (ToT). Learn why Chain-of-Thought fails, how multi-path reasoning works, and how to implement it in real-world RAG and agentic AI systems.

Tree of Thoughts LLM Reasoning Agentic AI RAG Prompt Engineering Beam Search Multi-Path Reasoning Gemini API

Read Article →

From Research to Revenue: Applying Google’s Small Model Strategy to Build an Intent-Aware Lead Scoring System

April 2026 · 5 min read

Learn how to build an intent-aware lead scoring system using Google’s small model strategy. Combine clickstream data, LLM pipelines, and hybrid system design to prioritize high-conversion leads and drive real business impact.

Lead Scoring Intent Detection Small Models LLM Systems Clickstream Data Agentic AI Human-in-the-Loop AI in Sales

Read Article →

From Query to Action: Inside an MCP-Powered Agentic AI System

April 2026 · 6 min read

Learn how MCP (Model Context Protocol) powers agentic AI systems. Explore architecture, tool orchestration, guardrails, and multi-agent design to build reliable, production-ready AI systems.

MCP Agentic AI LLM Systems AI Architecture Tool Calling Multi-Agent Systems AI Engineering Production AI

Read Article →

Agentic AI Systems Fail Silently: Here’s How to Monitor Them in Production

May 2026 · 7 min read

Agentic AI systems often fail silently in production. Learn how to monitor LLM pipelines, detect failures early, implement observability, and build reliable AI systems at scale.

Agentic AI LLM Observability AI Monitoring Production AI AI Systems AI Engineering LLM Pipelines AI Reliability

Read Article →

Lost in the Middle: Why LLMs Forget Critical Information in Long Contexts

May 2026 · 10 min read

Learn what the Lost in the Middle problem is in LLMs, why models ignore important information inside long contexts, how to detect this issue in production, and practical techniques to solve it in RAG and Agentic AI systems.

LLMs Long Context RAG Agentic AI Prompt Engineering Transformer Models AI Systems Production AI Context Windows

Read Article →

Before You Build RAG, Agents, or Fine-Tune: 10 Questions Every AI Engineer Should Ask

June 2026 · 12 min read

Learn a practical 10-question framework for choosing the right AI architecture. Discover when to use deterministic systems, prompt-only LLMs, RAG, Agentic RAG, multi-agent systems, and Human-in-the-Loop workflows while avoiding unnecessary production complexity.

AI Architecture System Design RAG Agentic AI Multi-Agent Systems LLMs Human-in-the-Loop Production AI AI Engineering

Read Article →