Alibaba Cloud Leads Funding Round in ShengShu Technology to Advance Multimodal AI Models
By Cygnus | 10 Apr 2026
Summary
- ShengShu Technology has raised fresh funding in a round led by Alibaba Cloud.
- The investment will support development of advanced AI models, including multimodal systems that process video, audio, and text.
- The deal highlights continued investor interest in next-generation AI technologies beyond traditional language models.
BEIJING, April 10, 2026 — ShengShu Technology has secured new funding in a round led by Alibaba Cloud, as the company expands its work on advanced artificial intelligence models capable of handling multiple data types.
Focus on Multimodal AI
The company is developing AI systems designed to process and generate content across formats such as video, audio, and text. These systems aim to improve how machines interpret complex real-world scenarios, an area gaining attention as AI development moves beyond text-based applications.
Industry participants increasingly refer to such approaches as “multimodal” or “world-model” research, where AI systems are trained on diverse datasets to better understand context and environment.
Product and Technology Development
ShengShu is known for its work in AI-generated video, including its Vidu platform. The latest funding is expected to support further development of these capabilities, alongside broader research into scalable AI infrastructure and model training.
Companies globally are investing in similar technologies as demand rises for tools that can generate richer and more realistic digital content.
Strategic Backing
For Alibaba Cloud, the investment aligns with its broader strategy to support AI innovation while expanding demand for cloud computing services. Training and deploying advanced AI models typically requires significant computing resources, making cloud providers key participants in the ecosystem.
The funding round also reflects continued momentum in China’s AI sector, where startups and established firms are accelerating development across generative AI and related technologies.
Why this matters
- Next-Gen AI: Development is shifting from text-based models to multimodal systems capable of handling real-world data.
- Cloud Demand: AI training is driving increased demand for large-scale computing infrastructure.
- Global Competition: Investments highlight intensifying competition in advanced AI development.
FAQs
Q1. What are multimodal AI models?
They are systems that can process and generate multiple types of data, such as text, images, audio, and video.
Q2. What is ShengShu known for?
The company develops AI tools focused on video generation and related technologies.
Q3. Why is Alibaba Cloud investing in AI startups?
Cloud providers benefit from increased demand for computing resources required to train and run advanced AI models.