Serverless

25 resources

Serverless

AWS Lambda

Amazon Web Services

Serverless compute service that runs code in response to events and automatically manages the compute resources, ideal for building AI-powered applications without managing servers.

Serverless ComputingEvent-drivenAutomatic ScalingServerless

Website Docs

Serverless

Azure Functions

Microsoft

Event-driven serverless compute platform that solves complex orchestration problems, with built-in AI integrations through Azure Cognitive Services and Azure Machine Learning.

Serverless ComputingDurable FunctionsAI IntegrationServerless

Website Docs

Serverless

Cerebrium

Serverless GPU platform for deploying custom AI models. Supports any Python-based ML framework with cold starts under 5 seconds.

GPU CloudDeploymentMLOpsGPU

Website Docs

Serverless

Chroma

The AI-native open-source embedding database with cloud-hosted serverless options for production deployments.

Vector SearchServerlessOpen SourceVector Database

Website Docs

Serverless

Cloudflare Workers AI

Cloudflare

Run AI inference at the edge across Cloudflare's global network. Serverless, pay-per-use access to curated open-source models with ultra-low latency.

Edge AIInferenceGlobalEdge

Website Docs

Serverless

Cohere API

Cohere

Enterprise NLP API specializing in embeddings, reranking, and RAG. Command R models are optimized for enterprise search and agentic workflows.

EmbeddingsRerankingEnterpriseRAG

Website Docs

Serverless

Cohere Serverless API

Cohere

Production-ready AI models via serverless API, including embeddings, text generation, and classification functions.

EmbeddingsText GenerationServerless APIAI Models

Website Docs

Serverless

Databricks Serverless AI

Databricks

End-to-end serverless ML platform that enables building, training, and deploying ML models with automatic infrastructure management and MLOps capabilities.

Serverless MLAutoMLModel ServingML Platform

Website Docs

Serverless

Deno Deploy

Deno

Serverless platform for JavaScript and TypeScript with built-in AI capabilities, featuring secure runtime and global distribution for deploying AI applications with low latency.

Edge ComputingTypeScript NativeServerlessTypeScript

Website Docs

Serverless

Fireworks AI

Production-grade generative AI platform offering fast inference for open-source LLMs and multimodal models with serverless and dedicated deployments.

InferenceLLMMultimodalServerless

Website Docs

Serverless

Groq API

Groq

Ultra-fast LLM inference API powered by LPU hardware, offering some of the fastest token generation speeds available for models like Llama 3 and Mixtral.

InferenceLLMAPIGroq

Website Docs

Serverless

Hugging Face Inference API

Hugging Face

Run thousands of open-source models via a simple API. No setup required — instantly access text, image, audio, and video models hosted by Hugging Face.

InferenceAPIOpen SourceServerless

Website Docs

Serverless

Langbase 2024

Langbase

A serverless vector database for AI applications with automatic embedding generation and cost-effective storage solutions.

Vector SearchServerlessEmbeddingsVector Database

Website Docs

Serverless

Lepton AI

Pythonic serverless platform for running and scaling AI workloads. Define your model as a Python class and deploy to GPUs in minutes with built-in autoscaling.

GPU CloudPythonAutoscalingGPU

Website Docs

Serverless

Milvus Cloud

Zilliz

Scalable, cloud-native vector database supporting trillion-scale data with managed serverless deployments.

Vector SearchServerlessScalableVector Database

Website Docs

Serverless

Modal

Modal Labs

Cloud platform for running AI/ML workloads. Write Python functions, deploy instantly, and scale to thousands of GPUs with no infrastructure management.

GPU CloudMLOpsPythonGPU

Website GitHub Docs

Serverless

OpenRouter

Unified API gateway for 100+ LLMs from OpenAI, Anthropic, Google, Meta, and more. Single integration, automatic fallbacks, and cost-based routing.

API GatewayMulti-modelRoutingAPI

Website Docs

Serverless

Pinecone

Pinecone Systems

Managed vector database designed for machine learning applications, with real-time vector search and serverless operations.

Vector SearchServerlessReal-timeVector Database

Website Docs

Serverless

Qdrant Cloud

Qdrant

Vector database for similarity search with a focus on extended filtering support and serverless operations.

Vector SearchServerlessFilteringVector Database

Website Docs

Serverless

Replicate

Run open-source machine learning models with a cloud API. Deploy and scale models instantly without managing infrastructure.

InferenceAPIMLOpsServerless

Website Docs

Serverless

Supabase Edge Functions

Supabase

Serverless functions for building AI applications with Postgres integration, perfect for building LLM-powered apps with vector search capabilities.

Edge ComputingPostgres IntegrationServerlessPostgres

Website Docs

Serverless

Together AI

Serverless and dedicated AI inference platform with access to 100+ open-source models. Features fast inference and fine-tuning capabilities.

InferenceFine-tuningAPIServerless

Website Docs

Serverless

Upstash Vector

Upstash

Serverless vector database optimized for AI applications with pay-per-request pricing model.

Vector SearchServerlessPay-per-requestVector Database

Website Docs

Serverless

Vercel AI SDK

Vercel

Library for building AI-powered user interfaces with React Server Components and streaming responses from AI providers like OpenAI, Anthropic, and more.

AI StreamingReact IntegrationServerless DeploymentAI SDK

Website Docs

Serverless

Weaviate Cloud

SeMI Technologies

A cloud-native, serverless vector search service with semantic search capabilities and GraphQL API.

Vector SearchServerlessGraphQLVector Database

Website Docs