A AI / LLM

17 articles

Learning Game

AI Concept Match Quiz

Match agent, RAG, and evaluation-operation keywords to the right explanations.

Progress

1 / 6

Score

Streak

Badge

Concept Explorer

Use keys 1-4 for quick answers.

Keyword

Agent Design

3 articles

View this group

#ai #agent #streaming #sse #websocket

AI Agent Streaming Design — Should You Use SSE or WebSocket?

In AI Agent services, user trust depends not only on the final answer but on how progress is shown during execution. This post compares SSE and WebSocket for token streaming, step status, tool execution events, and intermediate results, with practical guidance for real product teams.

TestForge Team April 19, 2026

#ai #agent #llm #service #backend

AI Agent Service Design Patterns — Tool Calling, State Management, and Guardrails

A practical guide to turning AI Agents into real services. Covers Tool Calling, Planner/Executor separation, session state management, human-in-the-loop workflows, failure handling, and cost control.

TestForge Team April 18, 2026

#ai #llm #agent #architecture #backend

AI Agent Architecture — From ReAct to Multi-Agent Systems

How to design production AI Agent systems. A practical guide covering the ReAct pattern, Tool Use, Memory management, Multi-Agent orchestration, and safety design.

TestForge Team March 13, 2026

Applied Agent Series

6 articles

View this group

#ai #rag #agent #architecture #investment

RAG-Based AI Stock Investment Agent Part 1 — Requirements and Overall Architecture

A practical blueprint for a RAG-based AI stock investment Agent. Covers product goals, user scenarios, system boundaries, core components, and end-to-end architecture for a research and paper-trading workflow.

TestForge Team April 18, 2026

#ai #rag #investment #data #search

RAG-Based AI Stock Investment Agent Part 2 — Building a Market Data, News, and Filing Knowledge Base

A practical guide to building the RAG data layer for an AI stock investment Agent. Covers price data, news, SEC filings, earnings transcripts, normalization, chunking, metadata, and freshness-aware retrieval.

TestForge Team April 18, 2026

#ai #agent #rag #investment #backend

RAG-Based AI Stock Investment Agent Part 3 — Agent Workflow, Tool Calling, and Analysis Chains

A practical design for the workflow of an AI stock investment Agent. Covers routing, query parsing, screening, retrieval analysis, quantitative analysis, risk evaluation, and final report composition.

TestForge Team April 18, 2026

#ai #investment #risk #backtest #architecture

RAG-Based AI Stock Investment Agent Part 4 — Portfolio Construction, Risk Rules, and Backtesting

Strong stock analysis is not enough to build a real investment Agent. This post explains position sizing, sector concentration limits, event risk, backtesting design, and paper-trading workflows.

TestForge Team April 18, 2026

#ai #fastapi #rag #backend #investment

RAG-Based AI Stock Investment Agent Part 5 — FastAPI, PostgreSQL, and pgvector System Design

A practical implementation blueprint for a RAG-based stock investment Agent using FastAPI, PostgreSQL, pgvector, Redis, async workers, and domain-separated service modules.

TestForge Team April 18, 2026

#ai #investment #operations #agent #monitoring

RAG-Based AI Stock Investment Agent Part 6 — Paper Trading, Monitoring, and Operational Guardrails

A practical operations guide for a stock investment Agent. Covers paper-trading workflow, human approval, monitoring, alerts, audit logs, failure handling, and the guardrails needed before any real execution.

TestForge Team April 18, 2026

Inference and Operations

2 articles

View this group

#fastapi #ai #llm #python #backend

Building an AI Inference Server with FastAPI — Production LLM Serving Guide

How to build a production-grade AI model inference server with FastAPI and uvicorn. Covers async processing, batch inference, GPU utilization, and Kubernetes deployment.

TestForge Team April 1, 2026

#llm #ai #operations #backend #cost

Operating LLM Services in Production — Stability Guide for AI Applications

How to reliably operate LLM-based services in production. Covers cost management, latency optimization, incident response, and monitoring — all from real-world experience.

TestForge Team March 11, 2026

RAG Engineering

6 articles

View this group

#ai #rag #llm #architecture #search

RAG Architecture Design Guide — From Retrieval Quality to Answer Generation

A practical guide to designing RAG systems. Covers document ingestion, chunking, embeddings, vector search, reranking, prompt composition, and evaluation from a real product engineering perspective.

TestForge Team April 18, 2026

#ai #rag #llm #data #architecture

RAG Development Part 1 — Document Ingestion and Data Cleaning Pipeline Design

RAG quality starts with data, not the model. This post explains how to choose source documents, clean HTML/PDF/wiki data, attach metadata, and build a production-ready ingestion pipeline.

TestForge Team April 18, 2026

#ai #rag #embedding #search #llm

RAG Development Part 2 — Chunking and Embedding Strategy for Better Retrieval

Chunking and embeddings define the floor of retrieval quality. This post covers chunk size, overlap, heading preservation, code block handling, embedding model selection, and indexing strategy.

TestForge Team April 18, 2026

#ai #rag #search #retrieval #llm

RAG Development Part 3 — Retrieval, Hybrid Search, and Reranking

Search quality largely defines RAG quality. This post explains dense retrieval, BM25, hybrid search, query rewriting, metadata filtering, and reranking from a practical engineering perspective.

TestForge Team April 18, 2026

#ai #rag #prompt #llm #service

RAG Development Part 4 — Answer Generation, Prompt Design, and Citations

Retrieval is only half of RAG. This post explains how to structure prompts, select and compress context, design citations, and make the system answer safely when evidence is weak.

TestForge Team April 18, 2026

#ai #rag #operations #evaluation #llm

RAG Development Part 5 — Evaluation, Observability, and Production Operations

To move RAG into production, you need quality evaluation, logging, latency tracking, and feedback loops. This post covers retrieval metrics, groundedness, citation accuracy, observability, and operational checklists.

TestForge Team April 18, 2026

A practical hub for operating and improving AI services

A AI / LLM

AI Concept Match Quiz

Agent Design

Applied Agent Series

Inference and Operations

RAG Engineering