Table of Contents
Data Observability Continues
Real-time data pop-up
Regulatory Growth
Datasheet format war
Language AI continues to amaze
Home Technology peripherals AI Data and Artificial Intelligence Technology Forecast for the Second Half of 2022

Data and Artificial Intelligence Technology Forecast for the Second Half of 2022

Apr 12, 2023 pm 09:34 PM
database AI data observability

Based on what we've seen so far in 2022, Datanami is confident it can make these five predictions for the rest of the year.

Data and Artificial Intelligence Technology Forecast for the Second Half of 2022

Data Observability Continues

The first half of the year has been huge for data observability, allowing customers to better understand data flows situation and develop relevant indicators. As data becomes more important to decision making, so does the health and availability of that data.

We’ve seen a number of data observability startups raise hundreds of millions of dollars in venture capital, including Cribl ($150 million Series D); Monte Carlo ($135 million Series D) ; Coralogix ($142 million Series D); and others. Other companies making news include Bigeye, which launched metadata metrics; StreamSets, acquired by Software AG for $580 million; and IBM, which acquired observability startup Databand last month.

This momentum will continue in the second half of 2022, as more data observability startups emerge from the woods and existing startups seek to solidify their positions in this emerging market.

Real-time data pop-up

Real-time data has been on the back burner for years, serving niche use cases but not actually being widely used in regular enterprises. However, thanks to the COVID-19 pandemic and the associated restructuring of business plans over the past few years, conditions are now ripe for real-time data to enter the mainstream tech scene.

“I think streaming is finally happening,” Databricks CEO Ali Ghodsi said at the recent Data AI Summit, noting a 2.5x increase in streaming workloads on the company’s cloud-based data platform . “They have more and more AI use cases that require real-time.”

In-memory databases and in-memory data grids are also poised to benefit from a real-time renaissance, if that’s the case. RocksDB, a fast analytics database that enhances event-based systems like Kafka, now has a replacement called Speedb. SingleStore, which combines OLTP and OLAP capabilities in a single relational framework, reached a $1.3 billion valuation in a funding round last month.

There is also StarRocks, which recently received funding for a fast new OLAP database based on Apache Doris; Imply completed a US$100 million Series D financing in May to continue its real-time analysis business based on Apache Druid; DataStax adds Apache Pulsar to its Apache Cassandra toolkit, raising $115 million to advance real-time application development. Datanami expects this focus on real-time data analytics to continue.

Regulatory Growth

It’s been four years since the GDPR came into effect, putting big data users in the spotlight and accelerating the rise of data governance as a necessary component of responsible data initiatives. In the United States, the task of regulating data access has fallen to the states, with California leading the way with the CCPA, which in many ways is modeled after the GPDR. But more states are likely to follow suit, complicating the data privacy equation for U.S. companies.

But GDPR and CCPA are just the beginning of regulations. We're also in the midst of the demise of third-party cookies, which make it harder for companies to track users' online behavior. Google's decision to delay the end of third-party cookies on its platform until January 1, 2023 gives marketers some extra time to adapt, but the information from the cookies will be difficult to replicate.

In addition to data regulations, we are also on the cusp of new regulations regarding the use of artificial intelligence. The EU introduced its Artificial Intelligence Bill in 2021, and experts predict it could become law by the end of 2022 or early 2023.

Datasheet format war

A classic technology war is shaping up new datasheet formats that will determine how data is stored in big data systems, who can access it, and who uses it What can be done with it.

In recent months, Apache Iceberg has gained momentum as a potential new standard for data table formats. Cloud data warehouse giants Snowflake and AWS came out earlier this year to back Iceberg, which provides transactional and other data controls and has emerged from work at Netflix and Apple. Former Hadoop distributor Cloudera also backed Iceberg in June.

But the folks at Databricks offer an alternative to Delta Lake tabular format that provides similar functionality to Iceberg. Apache Spark backers originally developed the Delta Lake tabular format in a proprietary manner, leading to accusations that Databricks was setting a lock-in for customers. But at the Data AI Summit in June, the company announced that it would make the entire format open source, allowing anyone to use it.

Lost in the shuffle is Apache Hudi, which also provides data consistency because it resides in a big data repository and can be accessed by various computing engines. Onehouse, a business backed by the creators of Apache Hudi, launched a Hudi-based Lakehouse platform earlier this year.

The big data ecosystem loves competition, so it will be interesting to watch these formats evolve and compete throughout the rest of 2022.

Language AI continues to amaze

The frontiers of artificial intelligence are getting sharper every month, and today, the spearhead of AI is big language models, which are getting better and better. In fact, large language models have become so good that in June a Google engineer claimed that the company's LaMDA conversational system had become sentient.

Artificial intelligence is not yet sentient, but that doesn’t mean they aren’t useful to businesses. As a reminder, Salesforce has a large language modeling (LLM) project called CodeGen, which is designed to understand source code and even generate its own code in different programming languages.

Last month, Meta (the parent company of Facebook) launched a massive language model that can translate into 200 languages. We’ve also seen efforts to democratize AI through projects like the BigScience Large Open Science Open Access Multilingual Language Model, or BLOOM.

The above is the detailed content of Data and Artificial Intelligence Technology Forecast for the Second Half of 2022. For more information, please follow other related articles on the PHP Chinese website!

Statement of this Website
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

AI Hentai Generator

AI Hentai Generator

Generate AI Hentai for free.

Hot Article

R.E.P.O. Energy Crystals Explained and What They Do (Yellow Crystal)
3 weeks ago By 尊渡假赌尊渡假赌尊渡假赌
R.E.P.O. Best Graphic Settings
3 weeks ago By 尊渡假赌尊渡假赌尊渡假赌
R.E.P.O. How to Fix Audio if You Can't Hear Anyone
3 weeks ago By 尊渡假赌尊渡假赌尊渡假赌
WWE 2K25: How To Unlock Everything In MyRise
4 weeks ago By 尊渡假赌尊渡假赌尊渡假赌

Hot Tools

Notepad++7.3.1

Notepad++7.3.1

Easy-to-use and free code editor

SublimeText3 Chinese version

SublimeText3 Chinese version

Chinese version, very easy to use

Zend Studio 13.0.1

Zend Studio 13.0.1

Powerful PHP integrated development environment

Dreamweaver CS6

Dreamweaver CS6

Visual web development tools

SublimeText3 Mac version

SublimeText3 Mac version

God-level code editing software (SublimeText3)

Bytedance Cutting launches SVIP super membership: 499 yuan for continuous annual subscription, providing a variety of AI functions Bytedance Cutting launches SVIP super membership: 499 yuan for continuous annual subscription, providing a variety of AI functions Jun 28, 2024 am 03:51 AM

This site reported on June 27 that Jianying is a video editing software developed by FaceMeng Technology, a subsidiary of ByteDance. It relies on the Douyin platform and basically produces short video content for users of the platform. It is compatible with iOS, Android, and Windows. , MacOS and other operating systems. Jianying officially announced the upgrade of its membership system and launched a new SVIP, which includes a variety of AI black technologies, such as intelligent translation, intelligent highlighting, intelligent packaging, digital human synthesis, etc. In terms of price, the monthly fee for clipping SVIP is 79 yuan, the annual fee is 599 yuan (note on this site: equivalent to 49.9 yuan per month), the continuous monthly subscription is 59 yuan per month, and the continuous annual subscription is 499 yuan per year (equivalent to 41.6 yuan per month) . In addition, the cut official also stated that in order to improve the user experience, those who have subscribed to the original VIP

Context-augmented AI coding assistant using Rag and Sem-Rag Context-augmented AI coding assistant using Rag and Sem-Rag Jun 10, 2024 am 11:08 AM

Improve developer productivity, efficiency, and accuracy by incorporating retrieval-enhanced generation and semantic memory into AI coding assistants. Translated from EnhancingAICodingAssistantswithContextUsingRAGandSEM-RAG, author JanakiramMSV. While basic AI programming assistants are naturally helpful, they often fail to provide the most relevant and correct code suggestions because they rely on a general understanding of the software language and the most common patterns of writing software. The code generated by these coding assistants is suitable for solving the problems they are responsible for solving, but often does not conform to the coding standards, conventions and styles of the individual teams. This often results in suggestions that need to be modified or refined in order for the code to be accepted into the application

Seven Cool GenAI & LLM Technical Interview Questions Seven Cool GenAI & LLM Technical Interview Questions Jun 07, 2024 am 10:06 AM

To learn more about AIGC, please visit: 51CTOAI.x Community https://www.51cto.com/aigc/Translator|Jingyan Reviewer|Chonglou is different from the traditional question bank that can be seen everywhere on the Internet. These questions It requires thinking outside the box. Large Language Models (LLMs) are increasingly important in the fields of data science, generative artificial intelligence (GenAI), and artificial intelligence. These complex algorithms enhance human skills and drive efficiency and innovation in many industries, becoming the key for companies to remain competitive. LLM has a wide range of applications. It can be used in fields such as natural language processing, text generation, speech recognition and recommendation systems. By learning from large amounts of data, LLM is able to generate text

Can fine-tuning really allow LLM to learn new things: introducing new knowledge may make the model produce more hallucinations Can fine-tuning really allow LLM to learn new things: introducing new knowledge may make the model produce more hallucinations Jun 11, 2024 pm 03:57 PM

Large Language Models (LLMs) are trained on huge text databases, where they acquire large amounts of real-world knowledge. This knowledge is embedded into their parameters and can then be used when needed. The knowledge of these models is "reified" at the end of training. At the end of pre-training, the model actually stops learning. Align or fine-tune the model to learn how to leverage this knowledge and respond more naturally to user questions. But sometimes model knowledge is not enough, and although the model can access external content through RAG, it is considered beneficial to adapt the model to new domains through fine-tuning. This fine-tuning is performed using input from human annotators or other LLM creations, where the model encounters additional real-world knowledge and integrates it

To provide a new scientific and complex question answering benchmark and evaluation system for large models, UNSW, Argonne, University of Chicago and other institutions jointly launched the SciQAG framework To provide a new scientific and complex question answering benchmark and evaluation system for large models, UNSW, Argonne, University of Chicago and other institutions jointly launched the SciQAG framework Jul 25, 2024 am 06:42 AM

Editor |ScienceAI Question Answering (QA) data set plays a vital role in promoting natural language processing (NLP) research. High-quality QA data sets can not only be used to fine-tune models, but also effectively evaluate the capabilities of large language models (LLM), especially the ability to understand and reason about scientific knowledge. Although there are currently many scientific QA data sets covering medicine, chemistry, biology and other fields, these data sets still have some shortcomings. First, the data form is relatively simple, most of which are multiple-choice questions. They are easy to evaluate, but limit the model's answer selection range and cannot fully test the model's ability to answer scientific questions. In contrast, open-ended Q&A

Five schools of machine learning you don't know about Five schools of machine learning you don't know about Jun 05, 2024 pm 08:51 PM

Machine learning is an important branch of artificial intelligence that gives computers the ability to learn from data and improve their capabilities without being explicitly programmed. Machine learning has a wide range of applications in various fields, from image recognition and natural language processing to recommendation systems and fraud detection, and it is changing the way we live. There are many different methods and theories in the field of machine learning, among which the five most influential methods are called the "Five Schools of Machine Learning". The five major schools are the symbolic school, the connectionist school, the evolutionary school, the Bayesian school and the analogy school. 1. Symbolism, also known as symbolism, emphasizes the use of symbols for logical reasoning and expression of knowledge. This school of thought believes that learning is a process of reverse deduction, through existing

SOTA performance, Xiamen multi-modal protein-ligand affinity prediction AI method, combines molecular surface information for the first time SOTA performance, Xiamen multi-modal protein-ligand affinity prediction AI method, combines molecular surface information for the first time Jul 17, 2024 pm 06:37 PM

Editor | KX In the field of drug research and development, accurately and effectively predicting the binding affinity of proteins and ligands is crucial for drug screening and optimization. However, current studies do not take into account the important role of molecular surface information in protein-ligand interactions. Based on this, researchers from Xiamen University proposed a novel multi-modal feature extraction (MFE) framework, which for the first time combines information on protein surface, 3D structure and sequence, and uses a cross-attention mechanism to compare different modalities. feature alignment. Experimental results demonstrate that this method achieves state-of-the-art performance in predicting protein-ligand binding affinities. Furthermore, ablation studies demonstrate the effectiveness and necessity of protein surface information and multimodal feature alignment within this framework. Related research begins with "S

Laying out markets such as AI, GlobalFoundries acquires Tagore Technology's gallium nitride technology and related teams Laying out markets such as AI, GlobalFoundries acquires Tagore Technology's gallium nitride technology and related teams Jul 15, 2024 pm 12:21 PM

According to news from this website on July 5, GlobalFoundries issued a press release on July 1 this year, announcing the acquisition of Tagore Technology’s power gallium nitride (GaN) technology and intellectual property portfolio, hoping to expand its market share in automobiles and the Internet of Things. and artificial intelligence data center application areas to explore higher efficiency and better performance. As technologies such as generative AI continue to develop in the digital world, gallium nitride (GaN) has become a key solution for sustainable and efficient power management, especially in data centers. This website quoted the official announcement that during this acquisition, Tagore Technology’s engineering team will join GLOBALFOUNDRIES to further develop gallium nitride technology. G

See all articles