Token Usage Tracking Enhancements #
- Per-Model Token Tracking: Improved token usage tracking with breakdown by model instead of aggregated totals
- New display format shows token usage for each model separately
- Format:
Model Name: X input, Y output, Z cache read, W cache write - Enhanced Cache Metrics: Better tracking of cache read and cache creation tokens
- Added functions to separately track cache read input tokens and cache creation input tokens
- More accurate cache usage reporting in API responses