Calling Hugging Face inference APIs from Azure ML

Mohamed Hussein 570 Reputation points
2024-12-26T12:05:46.5166667+00:00

Good Day,

Is that possible to call HuggingFace endpoints API (for the warm inference) without deploying the model itself from Azure ML?

Thank you

Azure Machine Learning
Azure Machine Learning
An Azure machine learning service for building and deploying models.
3,060 questions
{count} votes

1 answer

Sort by: Most helpful
  1. romungi-MSFT 48,236 Reputation points Microsoft Employee
    2024-12-27T12:12:06.8766667+00:00

    @Mohamed Hussein Different catalog models can have different pricing and subscription costs. This is usually displayed at the time of deploying the model from the model catalog page. In this case for hugging face, all models are listed to be deployed under managed compute, so the cost of deploying the model and calling the endpoint is based on how a real time endpoint is billed. So, if you have a hugging face endpoint the billing will use the credit or get billed under your pay as you go account.

    Some services are not always free and do not use credit, you will have to check the list of services here before planning to use them. I hope this clarifies.

    If this answers your query, do click Accept Answer and Yes for was this answer helpful. And, if you have any further query do let us know.

    0 comments No comments

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.