Home/
Part XIII — Expert Mode: Systems, Agents, and Automation/42. Fine-Tuning vs Prompting vs Retrieval (Decision Framework)/42.3 Data requirements and dataset quality
42.3 Data requirements and dataset quality
Overview and links for this section of the guide.
On this page
Data Requirements
| Dataset Size | Expected Quality |
|---|---|
| <100 examples | Unlikely to help |
| 100-500 | Style changes possible |
| 500-5000 | Good for narrow tasks |
| 5000+ | Strong domain adaptation |
Quality Checklist
Dataset Quality Checks:
✅ Consistent format across examples
✅ No contradictory examples
✅ Diverse edge cases represented
✅ Ground truth is actually correct
✅ No PII or sensitive data
✅ Representative of production distribution