Concept graph
Short definitions with deeper context and cross-links to sibling terms.
Product
reward model is a core generative-AI concept used across modeling, product, and governance discussions.
Training
RLHF aligns a model to human preferences using a reward model and reinforcement-style optimization.