api-inference

Join the Hugging Face community

and get access to the augmented documentation experience

Collaborate on models, datasets and Spaces

Faster examples with accelerated inference

Switch between documentation themes

to get started

Details on pinned models

Pinned models

Model pinning was only supported for existing customers. It is now deprecated.

If you’re interested in having a model that you can readily deploy for inference, take a look at our Inference Endpoints solution! It is a secure production environment with dedicated and autoscaling infrastructure, and you have the flexibility to choose between CPU and GPU resources.

Another option is using Spaces

←Parallelism and batch jobs More information about the API→