GENAIWIKI

Playbooks

Tutorials

Long-form guides optimized for engineers shipping GenAI features responsibly.

15 min read

Chunking Strategies for Legal PDFs: Improving Document Retrieval

This tutorial focuses on optimizing chunking strategies for legal documents to enhance retrieval accuracy. Prerequisites include familiarity with document processing and retrieval systems.

Updated today
chunkinglegalretrievalNLP

15 min read

Ensuring PII Handling in RAG Pipelines for Healthcare Applications

This tutorial outlines best practices for handling Personally Identifiable Information (PII) in retrieval-augmented generation (RAG) pipelines within healthcare settings. It emphasizes the importance of compliance and security measures. Prerequisites include knowledge of healthcare data regulations and RAG systems.

Updated today
RAGhealthcarePIIdata handling

12 min read

Implementing Cost Controls in RAG: Batching vs Streaming Tokens for E-commerce

This tutorial provides a comprehensive guide on implementing cost controls in retrieval-augmented generation (RAG) systems, focusing on the balance between batching and streaming tokens in e-commerce applications. It covers the implications of each approach on performance and cost. Prerequisites include familiarity with RAG systems and token management.

Updated today
RAGe-commercecost controltoken management

10 min read

Structured Outputs vs JSON Mode Tradeoffs in Financial Services

This tutorial explores the trade-offs between structured outputs and JSON mode in retrieval-augmented generation (RAG) systems specifically for financial services applications. It highlights how structured outputs can improve data integrity and ease of processing but may limit flexibility compared to JSON mode. Prerequisites include a basic understanding of RAG systems and their applications in finance.

Updated today
RAGfinancial servicesdata formatsstructured outputs

10 min read

Creating Runbooks When Quality Regresses Overnight

This tutorial provides a structured approach to creating effective runbooks for addressing quality regressions in machine learning models. Prerequisites include familiarity with ML deployment and monitoring processes.

Updated today
runbooksquality regressionmachine learning

14 min read

Golden-Set Design for RAG Faithfulness in Healthcare Applications

This tutorial focuses on designing golden sets for retrieval-augmented generation (RAG) systems in healthcare, ensuring the generated responses are faithful and reliable. Prerequisites include understanding RAG systems and familiarity with healthcare data.

Updated today
golden setRAGhealthcare applications

18 min read

Reducing Hallucinations with Citation Constraints in Academic Research Models

This tutorial outlines methods to reduce hallucinations in academic research models by implementing citation constraints. It targets researchers and developers working on language models for academic purposes. Prerequisites include familiarity with natural language processing and model training.

Updated today
hallucinationscitation constraintsacademic research

12 min read

Implementing SLI/SLO for Generative Endpoints

This tutorial outlines how to define and implement Service Level Indicators (SLIs) and Service Level Objectives (SLOs) for generative endpoints, ensuring high availability and performance. Prerequisites include understanding of SLIs, SLOs, and basic API concepts.

Updated today
SLISLOAPIgenerative models

15 min read

Embedding Drift Monitoring in Production for E-commerce

This tutorial covers how to implement embedding drift monitoring in production systems specifically for e-commerce applications. It focuses on detecting shifts in user behavior and product interactions that can affect recommendation systems. Prerequisites include familiarity with machine learning models and data pipelines.

Updated today
embedding drifte-commercemonitoring

10 min read

Understanding Offline vs Online Evaluation Frequency

This tutorial explores the trade-offs between offline and online evaluation methods for machine learning models, focusing on their impact on performance metrics and deployment strategies. Prerequisites include familiarity with basic ML concepts and evaluation metrics.

Updated today
evaluationmachine learningofflineonline

13 min read

Embedding Drift Monitoring in Production for Healthcare Applications

This tutorial covers the implementation of embedding drift monitoring in production systems for healthcare applications, ensuring model accuracy over time. Prerequisites include knowledge of machine learning models and monitoring techniques.

Updated today
embedding driftmonitoringhealthcaremachine learning

14 min read

Implementing Shadow Traffic for Safe Model Rollouts in E-commerce

This tutorial explains how to implement shadow traffic to test new models in an e-commerce environment without affecting live traffic. Prerequisites include knowledge of machine learning deployment and monitoring practices.

Updated today
shadow trafficmodel rolloute-commercemachine learning

15 min read

Implementing PII Handling in Retrieval Pipelines for Financial Services

This tutorial guides you through implementing PII handling in retrieval pipelines specifically tailored for financial services, ensuring compliance with regulations like GDPR and CCPA. Prerequisites include familiarity with data privacy laws and experience in building retrieval systems.

Updated today
PIIretrievalfinancial servicesdata privacy

14 min read

Structured Outputs vs JSON Mode Tradeoffs in E-commerce

This tutorial examines the trade-offs between structured outputs and JSON mode in e-commerce applications. Prerequisites include familiarity with data formats and experience in e-commerce platforms.

Updated today
E-commerceData FormatsStructured OutputsJSON

16 min read

Golden-Set Design for RAG Faithfulness in Financial Services

This tutorial discusses the design of golden sets to ensure the faithfulness of retrieval-augmented generation (RAG) systems in financial services. Prerequisites include experience with RAG systems and access to financial datasets.

Updated today
Financial ServicesRAGGolden SetData Quality

15 min read

Reducing Hallucinations with Citation Constraints in Academic Research

This tutorial explores how to effectively implement citation constraints to minimize hallucinations in academic research models. Prerequisites include familiarity with natural language processing (NLP) and access to a research dataset.

Updated today
NLPResearchHallucinationsCitation Constraints

22 min read

Embedding Drift Monitoring in Production for Financial Services

This tutorial focuses on techniques for monitoring embedding drift in production environments specifically tailored for financial services. Prerequisites include understanding of machine learning embeddings and production systems.

Updated today
embedding driftfinancial servicesmonitoringmachine learning

18 min read

Shadow Traffic for Safe Model Rollouts in E-commerce Platforms

This tutorial explains how to implement shadow traffic techniques for safely rolling out new machine learning models in e-commerce applications. Prerequisites include knowledge of machine learning deployment and A/B testing.

Updated today
shadow trafficmodel rolloute-commerceA/B testing

20 min read

Evaluating Tool-Calling Reliability Under Load in IT Support Systems

This tutorial focuses on assessing the reliability of tool-calling mechanisms in IT support systems during peak loads. Prerequisites include familiarity with load testing and IT support workflows.

Updated today
load testingIT supporttool reliabilityperformance evaluation

15 min read

Ensuring PII Handling in Retrieval Pipelines for Healthcare Applications

This tutorial covers best practices for managing Personally Identifiable Information (PII) in retrieval pipelines within healthcare settings, ensuring compliance with regulations like HIPAA. Prerequisites include understanding of data privacy laws and basic retrieval pipeline concepts.

Updated today
PIIhealthcaredata privacyretrieval pipelines

18 min read

Evaluating Tool-Calling Reliability Under Load in IT Support

This tutorial provides a framework for assessing the reliability of tool-calling in RAG systems under high load conditions, specifically for IT support applications. It requires knowledge of system performance metrics and load testing methodologies.

Updated today
RAGIT SupportLoad TestingPerformance Evaluation

16 min read

Ensuring PII Handling in RAG Pipelines for Legal Firms

This tutorial focuses on best practices for handling Personally Identifiable Information (PII) in RAG pipelines within legal firms. It requires knowledge of legal compliance and data protection standards.

Updated today
RAGLegalPII HandlingData Protection

14 min read

Implementing Cost Controls in RAG: Batching vs Streaming Tokens in Financial Services

This tutorial explores the cost implications of batching versus streaming token usage in RAG systems for financial services. It requires familiarity with RAG tokenization and financial data processing.

Updated today
RAGFinancial ServicesCost ControlTokenization

12 min read

Comparing Structured Outputs and JSON Mode for RAG in E-commerce

This tutorial examines the trade-offs between structured outputs and JSON mode in RAG systems tailored for e-commerce applications. It requires a basic understanding of RAG and JSON data formats.

Updated today
RAGE-commerceData StructureJSON

Publish a tutorial