GenAIWiki

Product graph

Tools directory

Filter by category, pricing posture, API availability, and tags.

Cloud AI platform · API

Amazon Bedrock

AWS managed service for invoking foundation models (Anthropic, Meta, Amazon Nova, Titan, and partners) with IAM, VPC, and data governance controls—single API surface for text, embeddings, and multimodal workloads in production.

FeaturedUpdated 6 weeks ago
awsenterpriseapi

Orchestration · API

AutoGen

AutoGen is a Microsoft Research–driven framework for building multi-agent conversations and tool-using agents with flexible conversation patterns—aimed at experimentation and production agents that coordinate LLMs, humans, and tools in complex flows.

Updated 6 weeks ago
agentsmulti-agentpython

Cloud AI platform · API

Azure OpenAI

Azure OpenAI Service delivers OpenAI models inside Microsoft Azure with private networking, regional deployment, and enterprise policy controls—so teams can use GPT-family models with the same procurement, identity, and compliance patterns as the rest of their Azure estate.

FeaturedUpdated 6 weeks ago
azureenterpriseapi

RAG · API

Chroma

Chroma is an open-source embedding database designed for managing and searching embeddings efficiently. It provides robust performance with sub-100ms latency for retrieval tasks.

FeaturedUpdated 6 weeks ago
RAGEmbedding DatabaseSearch

Developer tool · API

Claude Code

Anthropic’s Claude Code is a terminal- and IDE-oriented coding agent that works across a repository using Claude models—designed for multi-file edits, refactors, and test-driven iteration with explicit approvals. Capabilities and pricing follow Anthropic’s published product pages; verify current limits for your workspace.

FeaturedUpdated 3 weeks ago
codingagentscli

Orchestration · API

CrewAI

CrewAI is a Python framework for defining multi-agent “crews” with roles, goals, and delegated tasks—focused on readable orchestration of collaborative LLM agents for automation and research workflows.

FeaturedUpdated 6 weeks ago
agentsmulti-agentpython

IDE

Cursor

Cursor is an AI-native code editor (VS Code–familiar) with repo-wide context, inline edits, and agentic refactors aimed at product engineers shipping quickly. Model integrations and privacy controls evolve—verify the current product documentation for your plan and deployment mode.

FeaturedUpdated 3 weeks ago
codingagentseditor

Framework · API

DSPy

DSPy is a programming framework for building LM pipelines declaratively—optimizing prompts and few-shot demonstrations with compilers and metrics instead of hand-tuning every string—aimed at researchers and product teams who want systematic prompt improvement tied to eval scores.

Updated 6 weeks ago
promptingoptimizationevals

Inference · API

Fireworks AI

Fireworks AI offers fast, serverless inference APIs for leading open and proprietary models with a focus on low-latency chat and batch workloads, plus deployment options for teams standardizing on a single inference surface for production assistants and eval harnesses.

FeaturedUpdated 6 weeks ago
inferenceapiserverless

IDE assistant · API

GitHub Copilot

GitHub Copilot provides inline completions and chat inside supported editors with GitHub-centric identity, policy, and audit hooks—aimed at organizations that want AI assistance tightly coupled to repository permissions and enterprise agreements.

FeaturedUpdated 3 weeks ago
codingenterprisevscode

Inference · API

Groq

GroqCloud offers very low-latency, high-throughput LLM inference using Groq’s LPU-style hardware, with OpenAI-compatible APIs for select open and partner models aimed at interactive and batch production workloads.

FeaturedUpdated 6 weeks ago
inferencelatencyapi

ML platform · API

Hugging Face

Hub for open models, datasets, and Spaces demos, plus Inference Endpoints, Transformers, and enterprise features for teams that train, fine-tune, or serve open-weight and partner models at scale.

FeaturedUpdated 6 weeks ago
open-modelshubtraining

agents · API

Hugging Face Transformers

AI platform and model hub for discovering, hosting, and deploying open models, datasets, and inference endpoints across NLP, vision, audio, and multimodal tasks.

Updated 6 weeks ago
model hubinferenceopen models

Vector database · API

LanceDB

LanceDB is an embedded, serverless-friendly vector database built on the Lance columnar format—optimized for multimodal and large-scale local or object-store–backed retrieval with a small operational footprint for data science and edge-style deployments.

Updated 6 weeks ago
vectorsembeddedcolumnar

Orchestration · API

LangChain

Application framework for orchestrating LLM workflows, tool calling, retrieval, and agents across multiple providers in Python and TypeScript ecosystems.

FeaturedUpdated 6 weeks ago
orchestrationagentsRAG

Orchestration · API

LangGraph

LangGraph is a library for building stateful, cyclic agent and workflow graphs on top of LangChain—suited to multi-step tools, human-in-the-loop approvals, and durable execution patterns that go beyond linear chains.

FeaturedUpdated 3 weeks ago
agentsgraphsorchestration

Data framework · API

LlamaIndex

Data framework for LLM applications focused on ingestion pipelines, indexing, retrieval, and query orchestration over private and enterprise content sources.

FeaturedUpdated 6 weeks ago
RAGindexingretrieval

data · API

Milvus

An open-source vector database designed for high-performance similarity search and analysis of large-scale vector data. It handles millions of vectors efficiently with a query latency of under 100ms for similarity searches.

FeaturedUpdated 6 weeks ago
datavectordatabase

Compute · API

Modal

Serverless compute platform for AI inference and batch workloads, offering GPU execution, scalable workers, and code-first deployment patterns for model-powered applications.

Updated 6 weeks ago
serverlessGPUdeployment

productivity · API

Ollama

Local model runtime for running and serving open LLMs on developer machines and private infrastructure, with simple pull/run workflows and API access.

FeaturedUpdated 6 weeks ago
local modelsinferenceself-hosting

Developer tool · API

OpenAI Codex

OpenAI Codex is OpenAI’s coding-agent product for autonomous and interactive software engineering tasks in local and cloud workflows (CLI/agent surfaces). Model routing, modalities, and enterprise controls evolve—follow OpenAI’s official documentation for the exact feature matrix and data handling for your plan.

FeaturedUpdated 3 weeks ago
codingagentsopenai

IDE · API

OpenAI Playground

Provider of widely used frontier model APIs for text, vision, and audio, with strong developer tooling and broad ecosystem adoption across production AI applications.

FeaturedUpdated 6 weeks ago
LLMAPImultimodal

Model gateway · API

OpenRouter

OpenRouter aggregates access to many foundation models behind one API and billing surface, letting teams route prompts across providers for cost, capability, or failover without maintaining separate SDKs and accounts for every vendor.

FeaturedUpdated 6 weeks ago
routingapimulti-provider

Vector database · API

Pinecone

Managed vector database for semantic search and RAG systems with metadata filtering, namespaces, and cloud-hosted reliability for production retrieval workloads.

FeaturedUpdated 6 weeks ago
vector databaseRAGsemantic search

Submit a new tool entry