
San Francisco - Sep 05, 2025 (UTC) - Baseten, the company powering inference for the world’s fastest-growing AI products, today announced a $150 million Series D funding round at a $2.15 billion valuation. The round was led by BOND, with participation from new investors CapitalG (Alphabet’s growth fund), Premji Invest, and Kevin and Elizabeth Weil of Scribble Ventures. Existing investors including Conviction, 01A, IVP, Spark Capital, Greylock Partners, and BoxGroup also participated in the round.
The funding comes just six months after Baseten’s $75 million Series C round, nearly tripling the company’s valuation from $750 million. This rapid growth reflects the surging demand for AI inference infrastructure as companies increasingly deploy AI models in production applications.
Baseten offers a software platform that helps enterprises accelerate their AI inference workloads, promising up to 50% faster performance than competing products. The company’s technology can be deployed on customers’ own infrastructure or through a cloud-based version that runs across 10 different infrastructure-as-a-service platforms. When traffic increases, Baseten automatically provisions additional graphics cards on the most suitable platform and can switch between cloud providers if technical issues arise.
The platform uses topology-aware parallelism to optimize infrastructure by reducing data traffic between graphics cards, and employs operator fusion to combine separate calculations into single computations for improved efficiency. The company also provides AI model quantization tools that shrink neural networks to reduce memory requirements.
Beyond core inference capabilities, Baseten offers developer tools for automating AI model deployment and built-in observability features for monitoring performance metrics including request volumes, response times, and hardware usage. In May 2025, the company expanded beyond inference by launching an AI training service that provides infrastructure for building new AI models.
“Every breakout AI application depends on fast, reliable, and cost-effective inference, the same way the last 15 years of companies depended on the cloud,” said Tuhin Srivastava, co-founder and CEO of Baseten.
With the Series D funding, Baseten will scale its team and product to meet accelerating customer demand. The company plans to double down on investing in model performance research, infrastructure, and developer tooling while expanding its customer teams. The funding will also support research into new methods of speeding up AI models and expanding the company’s lineup of developer tools.
Jay Simons from BOND will join Baseten’s board of directors as part of the investment. The round also welcomed Sarah Guo from Conviction and Jill Chase from CapitalG to support the company’s growth trajectory.
The funding announcement underscores the critical importance of inference infrastructure in the AI ecosystem, as companies move beyond experimentation to deploying AI models at scale in production environments. Baseten’s rapid valuation growth reflects investors’ confidence in the company’s position as a leading platform for AI inference infrastructure.
About Baseten
Baseten is the premier inference platform powering AI applications for the world’s fastest-growing companies. The platform provides fast, reliable, and cost-effective AI model serving with advanced optimization techniques including topology-aware parallelism and operator fusion. Baseten supports both cloud-hosted and on-premises deployments across multiple infrastructure providers, enabling automatic scaling and failover capabilities. The company also offers comprehensive developer tools for model deployment automation and performance monitoring, along with AI training services for building custom models.