ai_gen function

Applies to: check marked yes Databricks SQL check marked yes Databricks Runtime

Important

This feature is in Public Preview.

In the preview:

  • The underlying language model can handle several languages, however these functions are tuned for English.
  • There is rate limiting for the underlying Foundation Model APIs. See Foundation Model APIs limits to update these limits.
  • Due to rate limiting, this function is designed for testing on small datasets that have less than 100 rows. For use cases with over 100 rows of data, Databricks recommends using ai_query and a provisioned throughput endpoint. See Perform batch LLM inference using ai_query.

The ai_gen() function invokes a state-of-the-art generative AI model to answer the user-provided prompt using SQL. This function uses a chat model serving endpoint made available by Databricks Foundation Model APIs.

Requirements

Important

The underlying models that might be used at this time are licensed under the Apache 2.0 License, Copyright © The Apache Software Foundation or the LLAMA 3.1 Community License Copyright © Meta Platforms, Inc. All rights reserved. Customers are responsible for ensuring compliance with applicable model licenses.

Databricks recommends reviewing these licenses to ensure compliance with any applicable terms. If models emerge in the future that perform better according to Databricks’s internal benchmarks, Databricks might change the model (and the list of applicable licenses provided on this page).

Currently, Meta-Llama-3.1-70B-Instruct is the underlying model that powers these AI functions.

Note

In Databricks Runtime 15.1 and above, this function is supported in Databricks notebooks, including notebooks that are run as a task in a Databricks workflow.

Syntax

ai_gen(prompt)

Arguments

  • prompt: A STRING expression.

Returns

A STRING.

Examples

> SELECT ai_gen('Generate a concise, cheerful email title for a summer bike sale with 20% discount');
  Summer Bike Sale: Grab Your Dream Bike at 20% Off!

> SELECT
    question,
    ai_gen(
      'You are a teacher. Answer the students question in 50 words: ' || question
    ) AS answer
  FROM
    questions
  LIMIT 10;