Product graph
Tools directory
Filter by category, pricing posture, API availability, and tags.
Cloud AI platform · API
Amazon Bedrock
AWS managed service for invoking foundation models (Anthropic, Meta, Amazon Nova, Titan, and partners) with IAM, VPC, and data governance controls—single API surface for text, embeddings, and multimodal workloads in production.
Orchestration · API
AutoGen
AutoGen is a Microsoft Research–driven framework for building multi-agent conversations and tool-using agents with flexible conversation patterns—aimed at experimentation and production agents that coordinate LLMs, humans, and tools in complex flows.
Cloud AI platform · API
Azure OpenAI
Azure OpenAI Service delivers OpenAI models inside Microsoft Azure with private networking, regional deployment, and enterprise policy controls—so teams can use GPT-family models with the same procurement, identity, and compliance patterns as the rest of their Azure estate.
RAG · API
Chroma
Chroma is an open-source embedding database designed for managing and searching embeddings efficiently. It provides robust performance with sub-100ms latency for retrieval tasks.
Developer tool · API
Claude Code
Anthropic’s Claude Code is a terminal- and IDE-oriented coding agent that works across a repository using Claude models—designed for multi-file edits, refactors, and test-driven iteration with explicit approvals. Capabilities and pricing follow Anthropic’s published product pages; verify current limits for your workspace.
Orchestration · API
CrewAI
CrewAI is a Python framework for defining multi-agent “crews” with roles, goals, and delegated tasks—focused on readable orchestration of collaborative LLM agents for automation and research workflows.
IDE
Cursor
Cursor is an AI-native code editor (VS Code–familiar) with repo-wide context, inline edits, and agentic refactors aimed at product engineers shipping quickly. Model integrations and privacy controls evolve—verify the current product documentation for your plan and deployment mode.
Framework · API
DSPy
DSPy is a programming framework for building LM pipelines declaratively—optimizing prompts and few-shot demonstrations with compilers and metrics instead of hand-tuning every string—aimed at researchers and product teams who want systematic prompt improvement tied to eval scores.
Inference · API
Fireworks AI
Fireworks AI offers fast, serverless inference APIs for leading open and proprietary models with a focus on low-latency chat and batch workloads, plus deployment options for teams standardizing on a single inference surface for production assistants and eval harnesses.
IDE assistant · API
GitHub Copilot
GitHub Copilot provides inline completions and chat inside supported editors with GitHub-centric identity, policy, and audit hooks—aimed at organizations that want AI assistance tightly coupled to repository permissions and enterprise agreements.
Inference · API
Groq
GroqCloud offers very low-latency, high-throughput LLM inference using Groq’s LPU-style hardware, with OpenAI-compatible APIs for select open and partner models aimed at interactive and batch production workloads.
ML platform · API
Hugging Face
Hub for open models, datasets, and Spaces demos, plus Inference Endpoints, Transformers, and enterprise features for teams that train, fine-tune, or serve open-weight and partner models at scale.
agents · API
Hugging Face Transformers
AI platform and model hub for discovering, hosting, and deploying open models, datasets, and inference endpoints across NLP, vision, audio, and multimodal tasks.
Vector database · API
LanceDB
LanceDB is an embedded, serverless-friendly vector database built on the Lance columnar format—optimized for multimodal and large-scale local or object-store–backed retrieval with a small operational footprint for data science and edge-style deployments.
Orchestration · API
LangChain
Application framework for orchestrating LLM workflows, tool calling, retrieval, and agents across multiple providers in Python and TypeScript ecosystems.
Orchestration · API
LangGraph
LangGraph is a library for building stateful, cyclic agent and workflow graphs on top of LangChain—suited to multi-step tools, human-in-the-loop approvals, and durable execution patterns that go beyond linear chains.
Data framework · API
LlamaIndex
Data framework for LLM applications focused on ingestion pipelines, indexing, retrieval, and query orchestration over private and enterprise content sources.
data · API
Milvus
An open-source vector database designed for high-performance similarity search and analysis of large-scale vector data. It handles millions of vectors efficiently with a query latency of under 100ms for similarity searches.
Compute · API
Modal
Serverless compute platform for AI inference and batch workloads, offering GPU execution, scalable workers, and code-first deployment patterns for model-powered applications.
productivity · API
Ollama
Local model runtime for running and serving open LLMs on developer machines and private infrastructure, with simple pull/run workflows and API access.
Developer tool · API
OpenAI Codex
OpenAI Codex is OpenAI’s coding-agent product for autonomous and interactive software engineering tasks in local and cloud workflows (CLI/agent surfaces). Model routing, modalities, and enterprise controls evolve—follow OpenAI’s official documentation for the exact feature matrix and data handling for your plan.
IDE · API
OpenAI Playground
Provider of widely used frontier model APIs for text, vision, and audio, with strong developer tooling and broad ecosystem adoption across production AI applications.
Model gateway · API
OpenRouter
OpenRouter aggregates access to many foundation models behind one API and billing surface, letting teams route prompts across providers for cost, capability, or failover without maintaining separate SDKs and accounts for every vendor.
Vector database · API
Pinecone
Managed vector database for semantic search and RAG systems with metadata filtering, namespaces, and cloud-hosted reliability for production retrieval workloads.