Workers AI

, Workers Launchpad

September 26, 2024 1:00 PM

Cloudflare’s bigger, better, faster AI platform

Cloudflare helps you build AI applications with fast inference at the edge, optimized AI workflows, and vector database-powered RAG solutions....

Birthday Week

, Vectorize

, AI Gateway

, AI

July 23, 2024 3:15 PM

Meta Llama 3.1 now available on Workers AI

Cloudflare is excited to be a launch partner with Meta to introduce Workers AI support for Llama 3.1...

Michelle Chen
, Nikhil Kothari

, AI

June 27, 2024 5:00 PM

Embedded function calling in Workers AI: easier, smarter, faster

Introducing a new way to do function calling in Workers AI by running function code alongside your inference. Plus, a new @cloudflare/ai-utils package to make getting started as simple as possible...

Product News

, Open Source

June 20, 2024 1:00 PM

Introducing Stream Generated Captions, powered by Workers AI

With one click, users can now generate video captions effortlessly using Stream’s newest feature: AI-generated captions for on-demand videos and recordings of live streams...

Developer Platform

, AI

May 22, 2024 1:00 PM

AI Gateway is generally available: a unified interface for managing and scaling your generative AI workloads

AI Gateway is an AI ops platform that provides speed, reliability, and observability for your AI applications. With a single line of code, you can unlock powerful features including rate limiting, custom caching, real-time logs, and aggregated analytics across multiple providers...

Developer Platform

, Open Source

, Connectivity Cloud

April 18, 2024 8:58 PM

Meta Llama 3 available on Cloudflare Workers AI

We are thrilled to give developers around the world the ability to build AI applications with Meta Llama 3 using Workers AI. We are proud to be a launch partner with Meta for their newest 8B Llama 3 model...

Llama

April 02, 2024 1:01 PM

Leveling up Workers AI: general availability and more new capabilities

Today, we’re excited to make a series of announcements, including Workers AI, Cloudflare’s inference platform becoming GA and support for fine-tuned models with LoRAs and one-click deploys from HuggingFace. Cloudflare Workers now supports the Python programming language, and more...

Developer Week

, General Availability

April 02, 2024 1:00 PM

Running fine-tuned models on Workers AI with LoRAs

Workers AI now supports fine-tuned models using LoRAs. But what is a LoRA and how does it work? In this post, we dive into fine-tuning, LoRAs and even some math to share the details of how it all works under the hood...

Michelle Chen
, Logan Grasby

Developers

, Developer Week

, AI

March 14, 2024 12:30 PM

Mitigating a token-length side-channel attack in our AI products

The Workers AI and AI Gateway team recently collaborated closely with security researchers at Ben Gurion University regarding a report submitted through our Public Bug Bounty program. Through this process, we discovered and fully ed a vulnerability affecting all LLM provider...

Celso Martinho
, Michelle Chen

Bug Bounty

, LLM

, Vulnerabilities

March 04, 2024 2:00 PM

Cloudflare launches AI Assistant for Security Analytics

Introducing AI Assistant for Security Analytics. Now it is easier than ever to get powerful insights about your web security. Use the new integrated natural language query interface to explore Security Analytics...

Jen Sells
, Harley Turan

Security Week

, Security

, WAF

February 28, 2024 8:00 PM

Unlocking new use cases with 17 new models in Workers AI, including new LLMs, image generation models, and more

In February 2024 we added 8 models for text generation, classification, and code generation use cases. Today, we’re back with 17 more models, focused on enabling new types of tasks and use cases ...

Michelle Chen
, Logan Grasby

February 06, 2024 8:00 PM

Adding new LLMs, text classification and code generation models to the Workers AI catalog

Workers AI is now bigger and better with 8 new models and improved model performance...

Michelle Chen
, Logan Grasby

, AI

, Open Source

December 06, 2023 2:00 PM

How we used OpenBMC to support AI inference on GPUs around the world

This is what Cloudflare has been able to do so far with OpenBMC with respect to our GPU-equipped servers...

, Deep Dive

November 23, 2023 2:00 PM

Workers AI Update: Stable Diffusion, Code Llama + Workers AI in 100 cities

We're thrilled to announce that Stable Diffusion and Code Llama are now available as part of Workers AI, running in over 100 cities across Cloudflare’s global network....

Phil Wittig