AI Platforms Directory
31 AI infrastructure and deployment platforms ranked by composite score — covering cloud AI, MLOps, model serving, vector databases, and enterprise AI solutions. Each platform is scored on adoption, quality, freshness, citations, and community engagement.
31 platforms
TensorFlow
by Google
TensorFlow is an open-source machine learning platform developed by Google. It provides a comprehensive ecosystem of tools, libraries, and community resources that lets researchers push the state-of-the-art in ML and developers easily build and deploy ML-powered applications.
PyTorch
by Meta AI
PyTorch is an open-source machine learning framework based on the Torch library, used for applications such as computer vision and natural language processing. It is primarily developed by Meta AI and is known for its dynamic computation graph and ease of use.
Meta AI Llama 3
by Meta AI
Meta AI Llama 3 is a family of open-source large language models (LLMs) released by Meta AI. It is designed for a wide range of natural language processing tasks, including text generation, translation, and question answering, and is intended to be a powerful and accessible tool for researchers and developers.
Hugging Face Transformers
by Hugging Face
Hugging Face Transformers is a popular open-source library providing pre-trained models and tools for natural language processing (NLP). It simplifies the process of using and fine-tuning state-of-the-art transformer models for various NLP tasks.
Weights & Biases (W&B)
by Weights & Biases
Weights & Biases (W&B) is a comprehensive MLOps platform for tracking, visualizing, and collaborating on machine learning experiments. It provides tools for experiment tracking, hyperparameter optimization, model versioning, and collaboration, aiming to streamline the ML development lifecycle.
Modal
by Modal Labs
Modal is a serverless platform designed for running AI/ML workloads in the cloud. It simplifies the deployment and scaling of applications, allowing developers to focus on code rather than infrastructure management. Modal supports a variety of use cases, including model training, inference, and data processing.
MLflow
by Databricks
MLflow is an open-source platform to manage the ML lifecycle, including experimentation, reproducibility, deployment, and a central model registry. It allows tracking experiments, packaging code into reproducible runs, and deploying models to various platforms.
Together AI Platform
by Together AI
Together AI Platform is a cloud platform that provides tools and infrastructure for training, fine-tuning, and deploying AI models. It focuses on making AI accessible and cost-effective, offering a range of open-source models and optimized hardware.
CoreWeave
by CoreWeave, Inc.
CoreWeave is a specialized cloud provider focusing on compute-intensive workloads like AI/ML, rendering, and visual effects. They offer a range of GPU-optimized virtual machines and Kubernetes clusters designed for demanding applications.
JAX
by Google Research
JAX is a numerical computation library developed by Google Research that combines Autograd and XLA to provide high-performance machine learning research. It offers automatic differentiation, GPU/TPU acceleration, and composable function transformations.
GroqCloud
by Groq
GroqCloud is a cloud service providing access to Groq's Tensor Streaming Architecture (TSA) for ultra-low latency inference. It's designed for applications requiring real-time responses from large language models and other AI models.
OctoAI Inference Service
by OctoML
OctoAI Inference Service is a fully managed platform for deploying and scaling AI models. It provides optimized hardware and software stacks to accelerate inference performance and reduce costs, supporting a wide range of models and frameworks.
Fireworks AI
by Fireworks AI, Inc.
Fireworks AI is a platform designed for deploying and scaling AI models, particularly large language models (LLMs). It offers a serverless inference solution with a focus on low latency and cost-effectiveness, enabling developers to easily integrate AI into their applications.
RunPod
by RunPod, Inc.
RunPod is a cloud platform specializing in GPU compute for AI/ML workloads. It offers both on-demand and reserved instances of various GPU types, allowing users to run training, inference, and other compute-intensive tasks at competitive prices.
Fal AI
by Fal AI, Inc.
Fal AI is a serverless platform designed for deploying and scaling AI applications, particularly those involving GPUs. It simplifies the process of deploying machine learning models and provides tools for building AI-powered applications with ease.
DeepInfra
by DeepInfra, Inc.
DeepInfra is a serverless inference platform that allows users to deploy and scale AI models with ease. It focuses on providing a simple and efficient way to serve models without managing infrastructure, supporting various frameworks and model types.
Baseten
by Baseten Labs
Baseten is a serverless platform designed for deploying and scaling machine learning models. It simplifies the process of turning models into production-ready APIs, offering features like autoscaling, monitoring, and custom container support.
Novita AI
by Novita AI
Novita AI is an AI art generation platform that provides a unified API for accessing multiple image generation models. It simplifies the process of creating AI art by offering a single endpoint for various models, enabling users to easily experiment with different styles and techniques.
Cerebrium AI
by Cerebrium, Inc.
Cerebrium AI is a platform designed for deploying and scaling machine learning models. It provides tools for model serving, monitoring, and management, aiming to simplify the deployment process for data scientists and ML engineers.
Replicate
by Replicate AI
A platform that allows developers to run open-source machine learning models via a simple API. It acts as a marketplace and hosting service for a wide variety of models, abstracting away the complexities of GPU infrastructure and deployment.
Portkey AI
by Portkey AI
Portkey AI acts as an AI gateway, providing a unified API layer for managing LLM interactions. It offers features like routing, caching, rate limiting, and guardrails to enhance reliability, performance, and control over LLM applications.
Modal
by Modal Labs
Modal provides a serverless compute platform optimized for machine learning workloads, allowing developers to run GPU-accelerated functions and applications without managing infrastructure. It simplifies the deployment and scaling of ML models and data pipelines.
Lambda Labs
by Lambda Labs
Lambda Labs provides a specialized GPU cloud for deep learning, offering high-performance GPUs and pre-configured environments for AI development and deployment. It caters to researchers and engineers needing powerful compute for complex AI workloads.
HuggingFace Spaces
by HuggingFace
A platform for building, hosting, and sharing interactive machine learning demo applications. It supports popular frameworks like Gradio and Streamlit, allowing users to showcase their models in an accessible web interface directly from the HuggingFace Hub.
Missing a platform?
Submit any AI platform to the index. Our research pipeline scores and enriches it automatically.
Submit a Platform