Chinese AI startup developing AI models and audio synthesis technology (StepAudio TTS).
$2.5BStrategic
Investor undisclosed
A $2.5B strategic round from hardware/telecom players (ZTE, OmniVision, Longcheer) signals Chinese AI infrastructure is consolidating around vertically-integrated stacks—these investors need embedded audio/voice models for devices, not just API access. If you're building voice or multimodal products, watch whether StepFun's TTS becomes the default in Chinese consumer hardware; that's a distribution moat Western startups don't have.