# How Will Generative AI Affect Open Data?

Conversational generative AI such as ChatGPT is changing the data use landscape. Although most conversational agents are not optimised for finding and retrieving datasets, they do hold some promise for supporting [users to make sense of datasets](https://theodi.org/news-and-events/blog/the-promise-and-challenge-of-data-discovery-with-llms/).&#x20;

Generative AI can also support automatic data-to-text approaches for [data summarisation](/user-centric-open-data-publishing/publication/dataset-summarisation.md). Data-to-text generation can produce summarisation sentences, which could then be edited by the publisher.

Croissant is a metadata format to help standardize machine learning (ML) datasets. The aim of Croissant is to make ML datasets easily discoverable and usable across tools and platforms. Croissant is easy to adopt because it doesn’t require changing the data itself or how it is represented. Instead, it adds a layer of metadata that represents the contents of the dataset in a standardized way, describing key attributes and properties.

We discuss the particular issues of opening data that is intended for machine learning uses in [Data Quality](/user-centric-open-data-publishing/preparation/data-quality-issues-for-machine-learning.md)[.](/user-centric-open-data-publishing/preparation/data-quality-issues-for-machine-learning.md) &#x20;

[Find out more about how to make your machine learning datasets discoverable and usable with Croissant](https://mlcommons.org/2024/03/croissant_metadata_announce/)


---

# Agent Instructions: Querying This Documentation

If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter:

```
GET https://user-centric-open-data-publishin.gitbook.io/user-centric-open-data-publishing/thinking-about-users/publish-your-docs/how-will-generative-ai-affect-open-data.md?ask=<question>
```

The question should be specific, self-contained, and written in natural language.
The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
