Home Technology peripherals AI The U.S. Department of Commerce publicly solicits comments and suggestions on GenAI data preparation

The U.S. Department of Commerce publicly solicits comments and suggestions on GenAI data preparation

Apr 28, 2024 am 11:52 AM
AI generative artificial intelligence data access

Data is the core of artificial intelligence. Without good data, the likelihood of developing useful AI models is slim. With this in mind, the U.S. Department of Commerce last week issued a public request for input on how to better prepare numerous public data sets for building production artificial intelligence (GenAI) models. This public request addresses a critical issue currently facing the field of artificial intelligence: the lack of high-quality, diverse public datasets. These datasets are critical for training machine learning models, fostering innovation, and driving the development of AI applications. Through this public request, the Department hopes to gather input from all parties to better understand how to build and manage public data sets. They want to hear

The U.S. Department of Commerce publicly solicits comments and suggestions on GenAI data preparation

# The U.S. Department of Commerce issued a request for information (RFI) on April 17 asking “industry experts, researchers, civil society organizations and other members of the public” on how to develop “artificial intelligence-enabled open data sets.”

The Department of Commerce calls itself the “America Data Agency” and is responsible for collecting, storing and analyzing a variety of data about the United States, including data on the economy, population and environment. A quick search of commercial data centers reveals more than 122,000 publicly accessible data sets on topics ranging from climate and weather to patents to census information.

This section is intended as technology changes and improves over the years, and the Department has sought assistance from private industry and public agencies to keep its data management and data sharing activities up to current technology standards. Accessing data electronically through machine-readable formats or through web services and APIs are examples of adapting its data services to the times.

Now, with the GenAI revolution upon us, the sector is looking for the most suitable location of data so that it can be used to build artificial intelligence models.

Oliver Wise, chief data officer of the U.S. Department of Commerce, wrote in the information request: "Today, with the emergence of artificial intelligence technology, the Department of Commerce is facing a new technological change, which provides users with Better access to information and data. “Business is particularly interested in generative artificial intelligence (GenAI) applications, which can digest text, images, audio, video and other types of information from different sources to produce new content. GenAI and other artificial intelligence technologies present opportunities and challenges for data providers such as businesses and data users including government entities, industry, academia, and the American people.” Word count not to exceed 482

“Intelligence”. suggests that the biggest challenge facing the business sector is getting human intelligence to work “Recent AI systems are trained on large amounts of digital content and generate responses based on the context of the content.” “However, these systems do not work in a Truly 'understand' text in a meaningful way."

Future artificial intelligence systems must be able to access data that is not only machine-readable, but also "machine-understandable." Today’s AI systems are limited by their reliance on massive unstructured data stores that rely on the underlying data rather than the ability to reason and judge based on understanding.

The Department of Commerce is seeking help in sharing data taking into account these fundamental limitations of GenAI technology. It is looking for new data dissemination standards, including licensing standards, for readable and understandable data. In terms of data accessibility and retrieval, the Department wants suggestions on how to make its data more accessible, such as through APIs or "web crawlers."

There is a particular need in how to use knowledge graphs that leverage metadata to better connect human terms to data. It also hopes to gain direction on the adoption of standard ontologies such as Schema.org or NIEM, and how knowledge graphs can help "harmonize and link" ontologies and vocabularies.

The Department seeks input from the community on how to advance these data standardization efforts while maintaining the highest standards of data integrity, quality, security, and ethics.

Wise is asking interested parties to send their proposals via email with the subject line “AI-Ready Open Data Asset Information Request for Information.” We hope to receive comments or feedback on these topics before July 16.

The above is the detailed content of The U.S. Department of Commerce publicly solicits comments and suggestions on GenAI data preparation. For more information, please follow other related articles on the PHP Chinese website!

Statement of this Website
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

Video Face Swap

Video Face Swap

Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Tools

Notepad++7.3.1

Notepad++7.3.1

Easy-to-use and free code editor

SublimeText3 Chinese version

SublimeText3 Chinese version

Chinese version, very easy to use

Zend Studio 13.0.1

Zend Studio 13.0.1

Powerful PHP integrated development environment

Dreamweaver CS6

Dreamweaver CS6

Visual web development tools

SublimeText3 Mac version

SublimeText3 Mac version

God-level code editing software (SublimeText3)

Bytedance Cutting launches SVIP super membership: 499 yuan for continuous annual subscription, providing a variety of AI functions Bytedance Cutting launches SVIP super membership: 499 yuan for continuous annual subscription, providing a variety of AI functions Jun 28, 2024 am 03:51 AM

This site reported on June 27 that Jianying is a video editing software developed by FaceMeng Technology, a subsidiary of ByteDance. It relies on the Douyin platform and basically produces short video content for users of the platform. It is compatible with iOS, Android, and Windows. , MacOS and other operating systems. Jianying officially announced the upgrade of its membership system and launched a new SVIP, which includes a variety of AI black technologies, such as intelligent translation, intelligent highlighting, intelligent packaging, digital human synthesis, etc. In terms of price, the monthly fee for clipping SVIP is 79 yuan, the annual fee is 599 yuan (note on this site: equivalent to 49.9 yuan per month), the continuous monthly subscription is 59 yuan per month, and the continuous annual subscription is 499 yuan per year (equivalent to 41.6 yuan per month) . In addition, the cut official also stated that in order to improve the user experience, those who have subscribed to the original VIP

Context-augmented AI coding assistant using Rag and Sem-Rag Context-augmented AI coding assistant using Rag and Sem-Rag Jun 10, 2024 am 11:08 AM

Improve developer productivity, efficiency, and accuracy by incorporating retrieval-enhanced generation and semantic memory into AI coding assistants. Translated from EnhancingAICodingAssistantswithContextUsingRAGandSEM-RAG, author JanakiramMSV. While basic AI programming assistants are naturally helpful, they often fail to provide the most relevant and correct code suggestions because they rely on a general understanding of the software language and the most common patterns of writing software. The code generated by these coding assistants is suitable for solving the problems they are responsible for solving, but often does not conform to the coding standards, conventions and styles of the individual teams. This often results in suggestions that need to be modified or refined in order for the code to be accepted into the application

Can fine-tuning really allow LLM to learn new things: introducing new knowledge may make the model produce more hallucinations Can fine-tuning really allow LLM to learn new things: introducing new knowledge may make the model produce more hallucinations Jun 11, 2024 pm 03:57 PM

Large Language Models (LLMs) are trained on huge text databases, where they acquire large amounts of real-world knowledge. This knowledge is embedded into their parameters and can then be used when needed. The knowledge of these models is "reified" at the end of training. At the end of pre-training, the model actually stops learning. Align or fine-tune the model to learn how to leverage this knowledge and respond more naturally to user questions. But sometimes model knowledge is not enough, and although the model can access external content through RAG, it is considered beneficial to adapt the model to new domains through fine-tuning. This fine-tuning is performed using input from human annotators or other LLM creations, where the model encounters additional real-world knowledge and integrates it

To provide a new scientific and complex question answering benchmark and evaluation system for large models, UNSW, Argonne, University of Chicago and other institutions jointly launched the SciQAG framework To provide a new scientific and complex question answering benchmark and evaluation system for large models, UNSW, Argonne, University of Chicago and other institutions jointly launched the SciQAG framework Jul 25, 2024 am 06:42 AM

Editor |ScienceAI Question Answering (QA) data set plays a vital role in promoting natural language processing (NLP) research. High-quality QA data sets can not only be used to fine-tune models, but also effectively evaluate the capabilities of large language models (LLM), especially the ability to understand and reason about scientific knowledge. Although there are currently many scientific QA data sets covering medicine, chemistry, biology and other fields, these data sets still have some shortcomings. First, the data form is relatively simple, most of which are multiple-choice questions. They are easy to evaluate, but limit the model's answer selection range and cannot fully test the model's ability to answer scientific questions. In contrast, open-ended Q&A

SOTA performance, Xiamen multi-modal protein-ligand affinity prediction AI method, combines molecular surface information for the first time SOTA performance, Xiamen multi-modal protein-ligand affinity prediction AI method, combines molecular surface information for the first time Jul 17, 2024 pm 06:37 PM

Editor | KX In the field of drug research and development, accurately and effectively predicting the binding affinity of proteins and ligands is crucial for drug screening and optimization. However, current studies do not take into account the important role of molecular surface information in protein-ligand interactions. Based on this, researchers from Xiamen University proposed a novel multi-modal feature extraction (MFE) framework, which for the first time combines information on protein surface, 3D structure and sequence, and uses a cross-attention mechanism to compare different modalities. feature alignment. Experimental results demonstrate that this method achieves state-of-the-art performance in predicting protein-ligand binding affinities. Furthermore, ablation studies demonstrate the effectiveness and necessity of protein surface information and multimodal feature alignment within this framework. Related research begins with "S

SK Hynix will display new AI-related products on August 6: 12-layer HBM3E, 321-high NAND, etc. SK Hynix will display new AI-related products on August 6: 12-layer HBM3E, 321-high NAND, etc. Aug 01, 2024 pm 09:40 PM

According to news from this site on August 1, SK Hynix released a blog post today (August 1), announcing that it will attend the Global Semiconductor Memory Summit FMS2024 to be held in Santa Clara, California, USA from August 6 to 8, showcasing many new technologies. generation product. Introduction to the Future Memory and Storage Summit (FutureMemoryandStorage), formerly the Flash Memory Summit (FlashMemorySummit) mainly for NAND suppliers, in the context of increasing attention to artificial intelligence technology, this year was renamed the Future Memory and Storage Summit (FutureMemoryandStorage) to invite DRAM and storage vendors and many more players. New product SK hynix launched last year

Laying out markets such as AI, GlobalFoundries acquires Tagore Technology's gallium nitride technology and related teams Laying out markets such as AI, GlobalFoundries acquires Tagore Technology's gallium nitride technology and related teams Jul 15, 2024 pm 12:21 PM

According to news from this website on July 5, GlobalFoundries issued a press release on July 1 this year, announcing the acquisition of Tagore Technology’s power gallium nitride (GaN) technology and intellectual property portfolio, hoping to expand its market share in automobiles and the Internet of Things. and artificial intelligence data center application areas to explore higher efficiency and better performance. As technologies such as generative AI continue to develop in the digital world, gallium nitride (GaN) has become a key solution for sustainable and efficient power management, especially in data centers. This website quoted the official announcement that during this acquisition, Tagore Technology’s engineering team will join GLOBALFOUNDRIES to further develop gallium nitride technology. G

A new era of VSCode front-end development: 12 highly recommended AI code assistants A new era of VSCode front-end development: 12 highly recommended AI code assistants Jun 11, 2024 pm 07:47 PM

In the world of front-end development, VSCode has become the tool of choice for countless developers with its powerful functions and rich plug-in ecosystem. In recent years, with the rapid development of artificial intelligence technology, AI code assistants on VSCode have sprung up, greatly improving developers' coding efficiency. AI code assistants on VSCode have sprung up like mushrooms after a rain, greatly improving developers' coding efficiency. It uses artificial intelligence technology to intelligently analyze code and provide precise code completion, automatic error correction, grammar checking and other functions, which greatly reduces developers' errors and tedious manual work during the coding process. Today, I will recommend 12 VSCode front-end development AI code assistants to help you in your programming journey.

See all articles