Time-to-First-Token Latency Reduction