Product
data contamination
data contamination is a core generative-AI concept used across modeling, product, and governance discussions.
Expanded definition
data contamination shows up constantly when teams ship LLM features. Practically, it influences how you design prompts, evaluate quality, and reason about failure modes. Teams should document how data contamination manifests in their stack—data handling, evaluation, and runtime guardrails—and revisit assumptions as models update.
Related terms
Explore adjacent ideas in the knowledge graph.