Measuring Large Language Model Understanding of Federal Statistical Data

Due: July 18, 2025

Generative AI applications offer transformative opportunities for how Americans interact with public data. By enabling interaction through natural language and multimodal prompts, these technologies facilitate more intuitive access to complex data collections through chat-based interfaces, reducing technical barriers and expanding the accessibility of public data to a broader range of users. To ensure that federal data are increasingly valuable in the training of generative AI applications, the federal government must optimize and enrich its data assets with the appropriate context for this rapidly evolving ecosystem.

This Request for Solutions (RFS) seeks to develop an empirical evaluation that measures the ability of large language models (LLMs) to accurately respond to questions that require an understanding of federal statistical open Government data assets and their associated metadata.[1] This will involve the creation of prompt-response pairs necessary to assess the accuracy, relevancy, and explainability of LLMs in federal statistical use cases. In addition, this effort will result in a tool that will evaluate LLM performance in response to these evaluation prompts, while also providing insight into how well federal statistical data assets are structured to support LLM interaction – highlighting opportunities to improve metadata quality, accessibility, and machine-readability. Ultimately, this RFS envisions the development of a tool that may be offered as part of a shared service within a future National Secure Data Service (NSDS) and lay the groundwork for replication and expansion across additional statistical subject-matter domains and agencies.

This opportunity does not require membership in America’s Datahub Consortium. However, if you win, you have to become a member of ADC (at no cost) to accept. This Consortium releases numerous opportunities throughout the year, so even if this one may not be right for you, we strongly encourage you to join the consortium, so you are ready for the next opportunity. 

Complete the form below and a program representative will reach out to you shortly to support you through the process. 

Contact ADC: Measuring Large Language Model Understanding of Federal Statistical Data
First
Last