When should I use an agent.md or claude.md file instead of a skill?
Only when you have truly proprietary or company-specific information that must be present on every single turn. For most users (about 95%), that constant inclusion just wastes tokens and degrades performance.
How do skills save tokens compared with agent.md files?
Skills use progressive disclosure: only the skill name and short description are in context each turn. The agent fetches the full skill file only when needed, reducing typical per-turn token cost from thousands to a few dozen tokens.
What is the recommended process to create a reliable skill?
Identify the workflow, walk the agent step-by-step through a successful run while correcting mistakes, then ask the agent to codify that successful run into a skill file and iteratively refine it when failures occur.
How should I scale agents for real productivity?
Start with a single agent and build robust skills and workflows first. Add sub-agents only after core workflows are reliable—avoid jumping to multi-agent setups for novelty.