All Resources

Serverless

25 resources

Serverless

AWS Lambda

Amazon Web Services

Serverless compute service that runs code in response to events and automatically manages the compute resources, ideal for building AI-powered applications without managing servers.

Serverless ComputingEvent-drivenAutomatic ScalingServerless
Serverless

Azure Functions

Microsoft

Event-driven serverless compute platform that solves complex orchestration problems, with built-in AI integrations through Azure Cognitive Services and Azure Machine Learning.

Serverless ComputingDurable FunctionsAI IntegrationServerless
Serverless

Cerebrium

Cerebrium

Serverless GPU platform for deploying custom AI models. Supports any Python-based ML framework with cold starts under 5 seconds.

GPU CloudDeploymentMLOpsGPU
Serverless

Chroma

Chroma

The AI-native open-source embedding database with cloud-hosted serverless options for production deployments.

Vector SearchServerlessOpen SourceVector Database
Serverless

Cloudflare Workers AI

Cloudflare

Run AI inference at the edge across Cloudflare's global network. Serverless, pay-per-use access to curated open-source models with ultra-low latency.

Edge AIInferenceGlobalEdge
Serverless

Cohere API

Cohere

Enterprise NLP API specializing in embeddings, reranking, and RAG. Command R models are optimized for enterprise search and agentic workflows.

EmbeddingsRerankingEnterpriseRAG
Serverless

Cohere Serverless API

Cohere

Production-ready AI models via serverless API, including embeddings, text generation, and classification functions.

EmbeddingsText GenerationServerless APIAI Models
Serverless

Databricks Serverless AI

Databricks

End-to-end serverless ML platform that enables building, training, and deploying ML models with automatic infrastructure management and MLOps capabilities.

Serverless MLAutoMLModel ServingML Platform
Serverless

Deno Deploy

Deno

Serverless platform for JavaScript and TypeScript with built-in AI capabilities, featuring secure runtime and global distribution for deploying AI applications with low latency.

Edge ComputingTypeScript NativeServerlessTypeScript
Serverless

Fireworks AI

Fireworks AI

Production-grade generative AI platform offering fast inference for open-source LLMs and multimodal models with serverless and dedicated deployments.

InferenceLLMMultimodalServerless
Serverless

Groq API

Groq

Ultra-fast LLM inference API powered by LPU hardware, offering some of the fastest token generation speeds available for models like Llama 3 and Mixtral.

InferenceLLMAPIGroq
Serverless

Hugging Face Inference API

Hugging Face

Run thousands of open-source models via a simple API. No setup required — instantly access text, image, audio, and video models hosted by Hugging Face.

InferenceAPIOpen SourceServerless
Serverless

Langbase 2024

Langbase

A serverless vector database for AI applications with automatic embedding generation and cost-effective storage solutions.

Vector SearchServerlessEmbeddingsVector Database
Serverless

Lepton AI

Lepton AI

Pythonic serverless platform for running and scaling AI workloads. Define your model as a Python class and deploy to GPUs in minutes with built-in autoscaling.

GPU CloudPythonAutoscalingGPU
Serverless

Milvus Cloud

Zilliz

Scalable, cloud-native vector database supporting trillion-scale data with managed serverless deployments.

Vector SearchServerlessScalableVector Database
Serverless

Modal

Modal Labs

Cloud platform for running AI/ML workloads. Write Python functions, deploy instantly, and scale to thousands of GPUs with no infrastructure management.

GPU CloudMLOpsPythonGPU
Serverless

OpenRouter

OpenRouter

Unified API gateway for 100+ LLMs from OpenAI, Anthropic, Google, Meta, and more. Single integration, automatic fallbacks, and cost-based routing.

API GatewayMulti-modelRoutingAPI
Serverless

Pinecone

Pinecone Systems

Managed vector database designed for machine learning applications, with real-time vector search and serverless operations.

Vector SearchServerlessReal-timeVector Database
Serverless

Qdrant Cloud

Qdrant

Vector database for similarity search with a focus on extended filtering support and serverless operations.

Vector SearchServerlessFilteringVector Database
Serverless

Replicate

Replicate

Run open-source machine learning models with a cloud API. Deploy and scale models instantly without managing infrastructure.

InferenceAPIMLOpsServerless
Serverless

Supabase Edge Functions

Supabase

Serverless functions for building AI applications with Postgres integration, perfect for building LLM-powered apps with vector search capabilities.

Edge ComputingPostgres IntegrationServerlessPostgres
Serverless

Together AI

Together AI

Serverless and dedicated AI inference platform with access to 100+ open-source models. Features fast inference and fine-tuning capabilities.

InferenceFine-tuningAPIServerless
Serverless

Upstash Vector

Upstash

Serverless vector database optimized for AI applications with pay-per-request pricing model.

Vector SearchServerlessPay-per-requestVector Database
Serverless

Vercel AI SDK

Vercel

Library for building AI-powered user interfaces with React Server Components and streaming responses from AI providers like OpenAI, Anthropic, and more.

AI StreamingReact IntegrationServerless DeploymentAI SDK
Serverless

Weaviate Cloud

SeMI Technologies

A cloud-native, serverless vector search service with semantic search capabilities and GraphQL API.

Vector SearchServerlessGraphQLVector Database