Results for "direct preference optimization"
Compute Governance
Intermediate
Regulating access to large-scale compute.
Fine-Tuning
Intermediate
Updating a pretrained model’s weights on task-specific data to improve performance or adapt style/behavior.
Artificial Intelligence
Intermediate
The field of building systems that perform tasks associated with human intelligence—perception, reasoning, language, planning, and decision-making—via algori...