Fireworks.ai is a California-based synthetic intelligence (AI) startup that’s providing a singular resolution for enterprises. The AI agency doesn’t construct massive language fashions (LLMs) or basis fashions from scratch however fine-tunes open-source fashions and converts them into an Application Programming Interface (API) to assist companies deploy the AI capabilities in a seamless trend. The fine-tuning reduces the scope of the AI mannequin and focuses it on a particular performance. This permits them to cut back cases of AI hallucinations and enhance the capabilities of the mannequin considerably.
The AI agency was co-founded by Lin Qiao who additionally holds the seat of the CEO within the firm. After serving because the Senior Director of Engineering at Meta and dealing with AI frameworks and platforms, Qiao and her workforce based the startup in October 2022, as per her LinkedIn profile. In a dialog with TechCrunch, she defined the enterprise mannequin of Fireworks.ai, highlighting the fine-tuning service they supply. She mentioned, “It can be either off the shelf, open source models or the models we tune or the models our customer can tune by themselves. All three varieties can be served through our inference engine API.”
This places the agency in a singular place the place whereas it’s not innovating on the basis mannequin degree, it’s bridging the hole between an LLM and a business-ready product that may be deployed seamlessly. With a main give attention to constructing APIs, Fireworks.ai lets its enterprise shoppers plug and play any open-source AI mannequin in its catalogue. As per the report, the corporate additionally lets companies experiment with completely different AI fashions to decide on the one that matches their wants.
At current, the startup claims to include 89 open-source LLMs corresponding to Mixtral MoE 8x7B Instruct, Meta’s Llama 2 70B Chat, Google’s Gemma 7B Instruct, Stability AI’s Stable Diffusion XL, and extra. The AI agency affords the fashions in both serverless format that doesn’t require companies to configure {hardware} or deploy fashions, or as on-demand fashions which can be found for devoted deployments, served on reserved GPU configurations in accordance with enterprise wants.
For the on-demand format, Fireworks.ai has three fee plans — Developer, Business, and Enterprise — the place the Developer plan comes with a pay-per-usage construction and a fee restrict of 600 requests per minute, the Enterprise tier has customized pricing affords and limitless fee limits. The serverless format is billed at a per-token pricing plan the place completely different fashions, relying on whether or not they’re text-only, image-only, or multimodal, will fetch a unique worth.