- Public preview announced for Fireworks AI integration into Microsoft Foundry to provide open-model inference through Azure.
- Fireworks AI’s inference engine is described as processing more than 13 trillion tokens daily and sustaining about 180,000 requests per second.
- Output speed is described as generating more than 1,000 tokens per second on large models.
Disclaimer: This news brief was created by Public Technologies (PUBT) using generative artificial intelligence. While PUBT strives to provide accurate and timely information, this AI-generated content is for informational purposes only and should not be interpreted as financial, investment, or legal advice. Microsoft Corporation published the original content used to generate this news brief on March 11, 2026, and is solely responsible for the information contained therein.