Deploy MCP Server
AI/ML Bearer Token

Replicate REST API

Run and deploy AI models via REST API in seconds

Replicate is a cloud platform that lets developers run machine learning models through a simple REST API without managing infrastructure. Deploy open-source models or custom ML models with automatic scaling, pay-per-use pricing, and production-ready inference endpoints. Used by developers to integrate AI capabilities like image generation, language models, speech synthesis, and more into applications.

Base URL https://api.replicate.com/v1

API Endpoints

MethodEndpointDescription
GET/modelsList all public models available on Replicate
GET/models/{model_owner}/{model_name}Get details about a specific model including versions and schema
GET/models/{model_owner}/{model_name}/versionsList all versions of a specific model
GET/models/{model_owner}/{model_name}/versions/{version_id}Get details about a specific model version
POST/predictionsCreate a new prediction (run a model) and receive results
GET/predictions/{prediction_id}Get the status and output of a prediction
POST/predictions/{prediction_id}/cancelCancel an in-progress prediction
GET/predictionsList all predictions for your account
POST/deployments/{deployment_owner}/{deployment_name}/predictionsCreate a prediction using a private deployment
GET/deploymentsList all deployments in your account
GET/deployments/{deployment_owner}/{deployment_name}Get details about a specific deployment
POST/trainingsStart a model training job with custom data
GET/trainings/{training_id}Get the status and results of a training job
POST/trainings/{training_id}/cancelCancel an in-progress training job
GET/collections/{collection_slug}Get a curated collection of models by category

Code Examples

curl -s -X POST \
  https://api.replicate.com/v1/predictions \
  -H "Authorization: Bearer $REPLICATE_API_TOKEN" \
  -H "Content-Type: application/json" \
  -d '{
    "version": "stability-ai/sdxl:39ed52f2a78e934b3ba6e2a89f5b1c712de7dfea535525255b1aa35c5565e08b",
    "input": {
      "prompt": "A serene lake at sunset with mountains",
      "num_inference_steps": 50
    }
  }'

Connect Replicate to AI

Deploy a Replicate MCP server on IOX Cloud and connect it to Claude, ChatGPT, Cursor, or any AI client. Your AI assistant gets direct access to Replicate through these tools:

run_replicate_model Execute any Replicate model with specified inputs and return the prediction results, supporting image generation, text generation, audio synthesis, and other AI tasks
get_prediction_status Check the status and retrieve outputs of a running or completed prediction by ID, useful for monitoring long-running model executions
search_models Search and filter available Replicate models by category, task type, or keywords to find appropriate models for specific AI use cases
train_custom_model Start a training job to fine-tune models with custom datasets, enabling personalized AI models for specific domains or styles
list_model_versions Retrieve all available versions of a model to compare capabilities, performance, and select the optimal version for a task

Deploy in 60 seconds

Describe what you need, AI generates the code, and IOX deploys it globally.

Deploy Replicate MCP Server →

Related APIs