Workers-AI

16 posts

Birthday-Week Partners Developer-Platform Workers-Launchpad Workers-AI Cloudflare-Workers Turnstile Performance Security Cache Speed Speed-Brain Developers AI

Wrapping up another Birthday Week celebration

Recapping all the big announcements made during 2024’s Birthday Week.

Kelly May Johnston9/30/2024

Cloudflare

Birthday-Week Vectorize AI-Gateway AI Developers Developer-Platform Workers-AI

Cloudflare’s bigger, better, faster AI platform

Whether you want the fastest inference at the edge, optimized AI workflows, or vector database-powered RAG, we’re excited to help you harness the full potential of AI and get started on building with Cloudflare.

Michelle Chen9/26/2024

Cloudflare

Workers-AI AI Product-News Developer-Platform Developers Open-Source

Meta Llama 3.1 now available on Workers AI

Cloudflare is excited to be a launch partner with Meta to introduce Workers AI support for Llama 3.1

Michelle Chen7/23/2024

Cloudflare

Product-News Workers-AI Developer-Platform Developers Open-Source AI Internship-Experience

Embedded function calling in Workers AI: easier, smarter, faster

Introducing a new way to do function calling in Workers AI by running function code alongside your inference. Plus, a new @cloudflare/ai-utils package to make getting started as simple as possible

Harley Turan6/27/2024

Cloudflare

Developer-Platform Developers Workers-AI AI Product-News Cloudflare-Stream

Introducing Stream Generated Captions, powered by Workers AI

With one click, users can now generate video captions effortlessly using Stream’s newest feature: AI-generated captions for on-demand videos and recordings of live streams

Mickie Betz6/20/2024

Cloudflare

Developer-Platform Developers Open-Source Workers-AI AI-Gateway AI

AI Gateway is generally available: a unified interface for managing and scaling your generative AI workloads

AI Gateway is an AI ops platform that provides speed, reliability, and observability for your AI applications. With a single line of code, you can unlock powerful features including rate limiting, custom caching, real-time logs, and aggregated analytics across multiple providers

Kathy Liao5/22/2024

Cloudflare

Llama Developers Developer-Platform Workers-AI Cloudflare-Workers Product-News

Meta Llama 3 available on Cloudflare Workers AI

We are thrilled to give developers around the world the ability to build AI applications with Meta Llama 3 using Workers AI. We are proud to be a launch partner with Meta for their newest 8B Llama 3 model

Michelle Chen4/18/2024

Cloudflare

Developer-Week Developers Workers-AI General-Availability Developer-Platform Cloudflare-Workers

Leveling up Workers AI: general availability and more new capabilities

Today, we’re excited to make a series of announcements, including Workers AI, Cloudflare’s inference platform becoming GA and support for fine-tuned models with LoRAs and one-click deploys from HuggingFace. Cloudflare Workers now supports the Python programming language, and more

Michelle Chen4/2/2024

Cloudflare

Developers Developer-Week Workers-AI AI Cloudflare-Workers Developer-Platform

Running fine-tuned models on Workers AI with LoRAs

Workers AI now supports fine-tuned models using LoRAs. But what is a LoRA and how does it work? In this post, we dive into fine-tuning, LoRAs and even some math to share the details of how it all works under the hood

Michelle Chen4/2/2024

Cloudflare

Bug-Bounty LLM Vulnerabilities Developer-Platform Workers-AI AI-Gateway SASE

Mitigating a token-length side-channel attack in our AI products

The Workers AI and AI Gateway team recently collaborated closely with security researchers at Ben Gurion University regarding a report submitted through our Public Bug Bounty program. Through this process, we discovered and fully patched a vulnerability affecting all LLM providers. Here’s the story

Celso Martinho3/14/2024

Cloudflare

Workers-AI Product-News

Unlocking new use cases with 17 new models in Workers AI, including new LLMs, image generation models, and more

On February 6th, 2024 we announced eight new models that we added to our catalog for text generation, classification, and code generation use cases. Today, we’re back with seventeen (17!) more models, focused on enabling new types of tasks and use cases with Workers AI

Michelle Chen2/28/2024

Cloudflare

Workers-AI Cloudflare-Workers AI Open-Source

Adding new LLMs, text classification and code generation models to the Workers AI catalog

Workers AI is now bigger and better with 8 new models and improved model performance

Michelle Chen http://blog.cloudflare.com/author/michelle/2/6/2024

Cloudflare

AI Workers-AI Developer-Platform Deep-Dive

How we used OpenBMC to support AI inference on GPUs around the world

This is what Cloudflare has been able to do so far with OpenBMC with respect to our GPU-equipped servers

Ryan Chow http://blog.cloudflare.com/author/ryan-chow/12/6/2023

Cloudflare

Workers-AI Cloudflare-Workers

Workers AI Update: Stable Diffusion, Code Llama + Workers AI in 100 cities

We're thrilled to announce that Stable Diffusion and Code Llama are now available as part of Workers AI, running in over 100 cities across Cloudflare’s global network.

Phil Wittig http://blog.cloudflare.com/author/phil/11/23/2023

Cloudflare

Workers-AI Mistral Cloudflare-Workers

Workers AI Update: Hello Mistral 7B

Today we’re excited to announce that we’ve added the Mistral-7B-v0.1-instruct to Workers AI

Jesse Kipp http://blog.cloudflare.com/author/jesse/11/21/2023

Cloudflare

Workers-AI Cloudflare-Workers Developer-Platform JavaScript Serverless

Streaming and longer context lengths for LLMs on Workers AI

Workers AI now supports streaming text responses for the LLM models in our catalog, including Llama-2, using server-sent events

Jesse Kipp http://blog.cloudflare.com/author/jesse/11/14/2023