Have you ever made these fatal mistakes in AI projects?-AI-php.cn

Table of Contents

1. Better understand the data

2. Stay data aware to avoid failure

3. Too much wrong data and insufficient correct data are killing AI projects

Home

Technology peripherals

Have you ever made these fatal mistakes in AI projects?

WBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWB

Apr 20, 2023 am 08:10 AM

project ai data

Translator|Bugatti

Reviewer|Sun Shujuan

Since data is the core of artificial intelligence (AI), AI and machine learning (ML) It’s no surprise that systems need enough good data to learn. Large amounts of high-quality data are generally required, especially for supervised learning methods, to properly train an AI or ML system. How much data is required depends on the model of AI being implemented, the algorithms used, and other factors such as internal data and third-party data. For example, neural networks require large amounts of data to train, while decision trees or Bayesian classifiers do not require as much data to obtain high-quality results.

So, you may think that the more data, the better, right? Please think again. Organizations with large amounts of data (even exabytes of data) realize that having more data does not solve the problem as expected. Indeed, with more data comes more questions. The more data you have, the more data you need to clean and prepare, the more data you need to label and manage, the more data you need to secure, protect, reduce bias and other measures. When you start increasing the amount of data, small projects can quickly turn into large projects. In fact, large amounts of data often kill projects.

Clearly the missing step between identifying a business problem and organizing data to solve that problem is determining what data is needed and how much of it is actually needed. You need enough data, but don’t have too much: no more, no less, just right. Unfortunately, organizations often jump into AI projects without understanding the data. Organizations need to answer many questions, including figuring out where the data is, how much data it already has, what state it is in, which characteristics of the data are most important, internal and external uses of the data, data access challenges, requirements to enhance existing data, and other key factors and questions. Without answering these questions, AI projects may fail or even drown in a sea of data.

1. Better understand the data

In order to understand how much data you need, you must first understand where the data is in the structure of the AI project s position. One visual way to help us understand the increasing value we get from data is the "DIKUW Pyramid" (sometimes called the "DIKW Pyramid"), which shows how the data foundation can be transformed through information, knowledge, understanding and wisdom. Help get greater value.

With a solid data foundation, you can gain deeper insights at the next layer of information, which can help you answer fundamental questions about that data. Once you've made basic connections between data to gain information insights, you can find patterns in that information and understand how the pieces of information connect together to gain deeper insights. Organizations can gain more value by building on the knowledge layer and understanding why these patterns occur, helping to understand the underlying patterns. Finally, you can get the most value from information at the intelligence level by deeply understanding the cause and effect of information decisions.

This recent wave of AI focuses most on the knowledge layer, as machine learning provides insights to identify patterns on top of the information layer. Unfortunately, machine learning hits a bottleneck at the understanding layer, because finding patterns is not enough to make inferences. We have machine learning, but we don’t have machine reasoning to understand why patterns occur. You see this limitation every time you interact with a chatbot. While machine learning-based natural language processing (NLP) is very good at understanding human speech and inferring intent, it encounters limitations when trying to understand and reason. For example, if you ask your voice assistant if you want to wear a raincoat tomorrow, it doesn't understand that you're asking about the weather. It's up to humans to provide this insight to machines because the voice assistant has no idea what rain actually is.

2. Stay data aware to avoid failure

Big data has taught us how to handle large amounts of data. Not just how the data is stored, but how all that data is processed, manipulated and analyzed. Machine learning adds even more value by processing the different types of unstructured, semi-structured or structured data that organizations collect. Indeed, this recent wave of AI is actually a wave of big data-driven analytics.

But it’s for this very reason that some organizations are taking a big hit when it comes to AI. Rather than running AI projects from a data-centric perspective, they focus on the functional aspects. To navigate AI projects and avoid fatal mistakes, organizations must better understand not only AI and machine learning, but also the several “Vs” of big data. It’s not just about how much data there is, but also about the nature of the data. Some of the V’s of big data include:

Quantity: The absolute amount of big data owned.
Speed: The speed at which big data changes. Successfully using AI means applying AI to high-speed data.
Diversity: Data can come in many different formats, including structured data like databases, semi-structured data like invoices, and unstructured data like emails, images, and video files. Successful AI systems can handle this diversity.
Authenticity: This refers to the quality and accuracy of the data and how much you trust that data. Garbage in, garbage out, especially in data-driven AI systems. Therefore, successful AI systems need to be able to handle widely varying data quality.

With decades of experience managing big data projects, organizations that are successful in AI have primarily been successful in big data. Organizations that have seen AI projects fail often approach AI problems with an application development mindset.

3. Too much wrong data and insufficient correct data are killing AI projects

Although the AI project started correctly, the lack of necessary data, lack of understanding, and lack of Solving real problems is killing AI projects. Organizations continue to move forward without a true understanding of the data and data quality required, which creates real challenges.

One of the reasons organizations make this data mistake is that they don’t have any real approach to AI projects other than using agile or application development methodologies. Yet successful organizations have realized that using a data-centric approach includes data understanding as the first stage of a project approach. The CRISP-DM approach, which has been around for more than 20 years, specifies data understanding as the next step after business needs are identified. Based on CRISP-DM and combined with agile methods, the Cognitive Project Management with AI (CPMAI) approach requires data understanding in the second phase. Other successful approaches also require understanding the data early in the project, because AI projects are, after all, data projects. How do you build a successful program on data if you approach it without understanding the data? This is definitely a fatal mistake you want to avoid.

Original link: https://www.forbes.com/sites/cognitiveworld/2022/08/20/are-you-making-these-deadly-mistakes-with-your -ai-projects/?sh=352955946b54

The above is the detailed content of Have you ever made these fatal mistakes in AI projects?. For more information, please follow other related articles on the PHP Chinese website!

Statement of this Website

The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn

Hot AI Tools

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress images for free

Clothoff.io

AI clothes remover

Video Face Swap

Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Article

Assassin's Creed Shadows: Seashell Riddle Solution

3 weeks ago By DDD

What's New in Windows 11 KB5054979 & How to Fix Update Issues

2 weeks ago By DDD

Where to find the Crane Control Keycard in Atomfall

3 weeks ago By DDD

Assassin's Creed Shadows - How To Find The Blacksmith And Unlock Weapon And Armour Customisation

1 months ago By DDD

Roblox: Dead Rails - How To Complete Every Challenge

3 weeks ago By DDD

Hot Tools

Notepad++7.3.1

Easy-to-use and free code editor

SublimeText3 Chinese version

Chinese version, very easy to use

Zend Studio 13.0.1

Powerful PHP integrated development environment

Dreamweaver CS6

Visual web development tools

SublimeText3 Mac version

God-level code editing software (SublimeText3)

Hot Topics

Where is the login entrance for gmail email?

7599

CakePHP Tutorial

1386

What is the format of the account name of steam

win11 activation key permanent

nyt connections hints and answers

123

Related knowledge

How to solve SQL parsing problem? Use greenlion/php-sql-parser! Apr 17, 2025 pm 09:15 PM

When developing a project that requires parsing SQL statements, I encountered a tricky problem: how to efficiently parse MySQL's SQL statements and extract the key information. After trying many methods, I found that the greenlion/php-sql-parser library can perfectly solve my needs.

How to solve the problem of PHP project code coverage reporting? Using php-coveralls is OK! Apr 17, 2025 pm 08:03 PM

When developing PHP projects, ensuring code coverage is an important part of ensuring code quality. However, when I was using TravisCI for continuous integration, I encountered a problem: the test coverage report was not uploaded to the Coveralls platform, resulting in the inability to monitor and improve code coverage. After some exploration, I found the tool php-coveralls, which not only solved my problem, but also greatly simplified the configuration process.

How to solve complex BelongsToThrough relationship problem in Laravel? Use Composer! Apr 17, 2025 pm 09:54 PM

In Laravel development, dealing with complex model relationships has always been a challenge, especially when it comes to multi-level BelongsToThrough relationships. Recently, I encountered this problem in a project dealing with a multi-level model relationship, where traditional HasManyThrough relationships fail to meet the needs, resulting in data queries becoming complex and inefficient. After some exploration, I found the library staudenmeir/belongs-to-through, which easily installed and solved my troubles through Composer.

How to solve the complexity of WordPress installation and update using Composer Apr 17, 2025 pm 10:54 PM

When managing WordPress websites, you often encounter complex operations such as installation, update, and multi-site conversion. These operations are not only time-consuming, but also prone to errors, causing the website to be paralyzed. Combining the WP-CLI core command with Composer can greatly simplify these tasks, improve efficiency and reliability. This article will introduce how to use Composer to solve these problems and improve the convenience of WordPress management.

How to solve the complex problem of PHP geodata processing? Use Composer and GeoPHP! Apr 17, 2025 pm 08:30 PM

When developing a Geographic Information System (GIS), I encountered a difficult problem: how to efficiently handle various geographic data formats such as WKT, WKB, GeoJSON, etc. in PHP. I've tried multiple methods, but none of them can effectively solve the conversion and operational issues between these formats. Finally, I found the GeoPHP library, which easily integrates through Composer, and it completely solved my troubles.

git software installation tutorial Apr 17, 2025 pm 12:06 PM

Git Software Installation Guide: Visit the official Git website to download the installer for Windows, MacOS, or Linux. Run the installer and follow the prompts. Configure Git: Set username, email, and select a text editor. For Windows users, configure the Git Bash environment.

How to solve the problem of virtual columns in Laravel model? Use stancl/virtualcolumn! Apr 17, 2025 pm 09:48 PM

During Laravel development, it is often necessary to add virtual columns to the model to handle complex data logic. However, adding virtual columns directly into the model can lead to complexity of database migration and maintenance. After I encountered this problem in my project, I successfully solved this problem by using the stancl/virtualcolumn library. This library not only simplifies the management of virtual columns, but also improves the maintainability and efficiency of the code.

The latest tutorial on how to read the key of git software Apr 17, 2025 pm 12:12 PM

This article will explain in detail how to view keys in Git software. It is crucial to master this because Git keys are secure credentials for authentication and secure transfer of code. The article will guide readers step by step how to display and manage their Git keys, including SSH and GPG keys, using different commands and options. By following the steps in this guide, users can easily ensure their Git repository is secure and collaboratively smoothly with others.

See all articles