#AI

34 posts

Sunday, June 7, 2026 4 min read

LLM App Development #10: Building an AI Agent

Build an agent where Claude chooses its own tools and takes multiple steps to get work done. The agent loop, orchestrating several tools, and safe termination conditions.

AI LLM Claude

Saturday, June 6, 2026 5 min read

LLM App Development #9: Conversation Memory and Context Management

How to handle the history that piles up as a conversation grows. Context limits and token cost, sliding windows and summary compaction, and server-side compaction.

AI LLM Claude

Friday, June 5, 2026 5 min read

LLM App Development #8: Building a RAG Pipeline

Hand the documents found by vector search to Claude so it answers based on our documents. We complete RAG, building the full pipeline of chunking, retrieval, prompt injection, and generation.

AI LLM Claude

Thursday, June 4, 2026 5 min read

LLM App Development #7: Embeddings and Vector Search

Embeddings turn text into vectors that carry meaning, and vector search finds similar documents using those vectors. The preparation step for RAG in the next part.

AI LLM Claude

Wednesday, June 3, 2026 5 min read

LLM App Development #6: Connecting External Functions with Tool Calling

Letting Claude call functions you define. Tool definitions, handling tool_use, and the execution loop — the foundation for connecting Claude to external APIs and databases.

AI LLM Claude

Tuesday, June 2, 2026 5 min read

LLM App Development #5: Getting Structured Output

Going beyond narrowing output with a prompt by enforcing it with a JSON schema. With a Pydantic model and messages.parse, you receive a validated result and plug it straight into code.

AI LLM Claude

Monday, June 1, 2026 7 min read

LLM App Development #4: Prompt Engineering in Practice

The same question can yield very different answers depending on how you ask. Specific instructions, output formatting, examples, and separating structure with tags — how to write prompts that reliably draw out the result you want.

AI LLM Claude

Sunday, May 31, 2026 6 min read

LLM App Development #3: Streaming Responses in Real Time

Instead of waiting for the full response, stream it to the screen as it is generated. With messages.stream and text_stream, you sharply cut the perceived wait for the first characters to appear.

AI LLM Claude

Saturday, May 30, 2026 6 min read

LLM App Development #2: Understanding Messages and Parameters

The role structure of messages and multi-turn conversations, the system prompt, and core parameters like max_tokens and temperature. The foundation for conveying context and instructions to Claude precisely.

AI LLM Claude

Friday, May 29, 2026 7 min read

LLM App Development #1: Your First API Call and Environment Setup

What makes an LLM app different from a typical backend, and a walkthrough from issuing an Anthropic API key to getting your first response with the Python SDK. The starting point of the series.