#agents
5 articles
Model Context Protocol (MCP) and Tool-Use Infrastructure
How LLMs safely reach the outside world — from raw function calling to MCP, the open standard that collapses N×M bespoke integrations to N+M, with production-grade security, reliability, and a ~88% token reduction via deferred tool loading.
Design a Customer-Support AI Assistant
Architect a production customer-support AI that deflects 60–80% of tickets by combining RAG over a help center, real-action tools (refunds, cancellations, account changes), per-session memory, guardrails, and a structured handoff to a human agent — all while keeping hallucination below 2%.
Design a Realtime Voice AI Agent
Build a full-duplex voice agent that answers PSTN and WebRTC calls with sub-800ms voice-to-voice latency — covering the cascade pipeline (STT→LLM→TTS), turn detection, barge-in, telephony transport, and scaling to thousands of simultaneous calls.
Design an AI Coding Assistant (Copilot / Cursor)
Architect a system that delivers inline ghost-text completions in under 200ms and drives an autonomous agent that edits dozens of files — the two-product architecture behind GitHub Copilot, Cursor, and Sourcegraph Cody at billions of completions per day.
Design an AI Agent Platform
Build a platform that runs autonomous LLM agents — each capable of planning, calling tools, and completing multi-step tasks lasting minutes to hours — with durable state, idempotent tool execution, and per-tenant safety guardrails.