MLOps: Are businesses repeating the same DIY mistakes?-AI-php.cn

Table of Contents

Opening

Translator Introduction

Home

Technology peripherals

MLOps: Are businesses repeating the same DIY mistakes?

PHPz

Apr 08, 2023 pm 02:11 PM

AI cloud computing machine learning

Translator | Cui Hao

Reviewer | Sun Shujuan

Opening

MLOps: Are businesses repeating the same DIY mistakes?

## Generally speaking, companies will not take the initiative There are reasons for building your own cloud computing infrastructure. Over the past decade, IT infrastructure teams have attempted to build their own private clouds because they believed they would support their businesses more cost-effectively than public clouds. But contrary to expectations, the time and cost spent on the private cloud exceeded expectations. After the private cloud was built, more resources were needed to maintain it, and it was slightly inferior to the public cloud in terms of security and expansion. As a result, companies that build their own private clouds end up not having more resources to invest in core business, but instead invest a lot of time and personnel in infrastructure that cannot expand business needs.

Now, many enterprises generate solutions through various open source tools (such as Apache Spark), but most actions for MLOps require repeated manual operations.

This results in model deployments taking weeks or even months, inefficient run times (measured by computation and inference taking time to run), and a lack of observations for model testing and monitoring. Also, the approach used was too customized to provide scalable, reusable business processes for multiple use cases in different parts of the enterprise.

A case of misdiagnosed problem

In addition, through conversations with business line leaders and chief data analytics officers, we came to the conclusion that although the organization hired many data scientists, they did not look at to any return. As the research deepens, they will continue to ask various questions and use these questions to identify the difficulties and obstacles faced by artificial intelligence. They quickly realized that the key issue was in the “last mile” – deploying the models and applying them to real-time data, executing them efficiently so that the benefits outweighed the costs, and thus their performance could be better measured.

To solve business problems and make business decisions, data scientists transform data into models. This process requires two sets of skills: first, the expertise and skills needed to build great models; second, the skills to use code to drive the model in the real world, while monitoring and updating the model. However, these two types of skills are completely different.

It is precisely because of this difference that ML engineers come into play. ML Engineers integrate tools and frameworks to ensure data, pipelines, and infrastructure work together to produce ML models at scale.

So, what to do now? Hire more machine learning engineers?

Even with the best ML engineers, enterprises still face two major problems when scaling AI:

There is a lack of repeatable, scalable best practices for deploying models no matter where and how they are built: the modern enterprise data ecosystem The reality is that different business units use different data platforms based on their data and technology requirements (for example, product teams may need to support streaming data, while finance needs to provide a simple query interface for non-technical users). In addition, data science also requires decentralizing applications across business units rather than centralizing applications. In other words, different data science teams have a unique set of model training frameworks for the use cases (domains) they focus on, which means that a one-size-fits-all training framework cannot be established for the entire enterprise (including multiple departments/domains) of.

How to get the most value from artificial intelligence

In order to improve automation capabilities; in order to provide large-scale user personalized experiences; in order to deliver more accurate, more granular and predictable users Promised, companies are already investing billions of dollars into artificial intelligence. But so far, there’s been a huge gap between AI’s promise and results, with only about 10% of AI investments generating significant ROI.

Finally, in order to solve the MLOps problem, chief data analytics officers need to build their own capabilities around data science at the core of the business, while also investing in other technologies related to MLOps automation. This is a common "build vs. buy" dilemma. It is not only considered from an operational perspective (cost-benefit), but also needs to consider the speed and efficiency of AI investment permeating throughout the enterprise, and whether new methods can be generated in a better way. revenue products and customer base, or cut costs by increasing automation and reducing waste.

Translator Introduction

Cui Hao, 51CTO community editor and senior architect, has 18 years of software development and architecture experience and 10 years of distributed architecture experience. Formerly a technical expert at HP. He is willing to share and has written many popular technical articles with more than 600,000 reads. Author of "Principles and Practice of Distributed Architecture".

Original title:MLOps | Is the Enterprise Repeating the Same DIY Mistakes?

The above is the detailed content of MLOps: Are businesses repeating the same DIY mistakes?. For more information, please follow other related articles on the PHP Chinese website!

Statement of this Website

The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn

Hot AI Tools

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress images for free

Clothoff.io

AI clothes remover

AI Hentai Generator

Generate AI Hentai for free.

Hot Article

Repo: How To Revive Teammates

1 months ago By 尊渡假赌尊渡假赌尊渡假赌

R.E.P.O. Energy Crystals Explained and What They Do (Yellow Crystal)

2 weeks ago By 尊渡假赌尊渡假赌尊渡假赌

Hello Kitty Island Adventure: How To Get Giant Seeds

1 months ago By 尊渡假赌尊渡假赌尊渡假赌

How Long Does It Take To Beat Split Fiction?

4 weeks ago By DDD

R.E.P.O. Save File Location: Where Is It & How to Protect It?

4 weeks ago By DDD

Hot Tools

Notepad++7.3.1

Easy-to-use and free code editor

SublimeText3 Chinese version

Chinese version, very easy to use

Zend Studio 13.0.1

Powerful PHP integrated development environment

Dreamweaver CS6

Visual web development tools

SublimeText3 Mac version

God-level code editing software (SublimeText3)

Hot Topics

Where is the login entrance for gmail email?

7365

Java Tutorial

1628

CakePHP Tutorial

1353

Laravel Tutorial

1266

PHP Tutorial

1214

Related knowledge

Bytedance Cutting launches SVIP super membership: 499 yuan for continuous annual subscription, providing a variety of AI functions Jun 28, 2024 am 03:51 AM

This site reported on June 27 that Jianying is a video editing software developed by FaceMeng Technology, a subsidiary of ByteDance. It relies on the Douyin platform and basically produces short video content for users of the platform. It is compatible with iOS, Android, and Windows. , MacOS and other operating systems. Jianying officially announced the upgrade of its membership system and launched a new SVIP, which includes a variety of AI black technologies, such as intelligent translation, intelligent highlighting, intelligent packaging, digital human synthesis, etc. In terms of price, the monthly fee for clipping SVIP is 79 yuan, the annual fee is 599 yuan (note on this site: equivalent to 49.9 yuan per month), the continuous monthly subscription is 59 yuan per month, and the continuous annual subscription is 499 yuan per year (equivalent to 41.6 yuan per month) . In addition, the cut official also stated that in order to improve the user experience, those who have subscribed to the original VIP

Context-augmented AI coding assistant using Rag and Sem-Rag Jun 10, 2024 am 11:08 AM

Improve developer productivity, efficiency, and accuracy by incorporating retrieval-enhanced generation and semantic memory into AI coding assistants. Translated from EnhancingAICodingAssistantswithContextUsingRAGandSEM-RAG, author JanakiramMSV. While basic AI programming assistants are naturally helpful, they often fail to provide the most relevant and correct code suggestions because they rely on a general understanding of the software language and the most common patterns of writing software. The code generated by these coding assistants is suitable for solving the problems they are responsible for solving, but often does not conform to the coding standards, conventions and styles of the individual teams. This often results in suggestions that need to be modified or refined in order for the code to be accepted into the application

Seven Cool GenAI & LLM Technical Interview Questions Jun 07, 2024 am 10:06 AM

To learn more about AIGC, please visit: 51CTOAI.x Community https://www.51cto.com/aigc/Translator|Jingyan Reviewer|Chonglou is different from the traditional question bank that can be seen everywhere on the Internet. These questions It requires thinking outside the box. Large Language Models (LLMs) are increasingly important in the fields of data science, generative artificial intelligence (GenAI), and artificial intelligence. These complex algorithms enhance human skills and drive efficiency and innovation in many industries, becoming the key for companies to remain competitive. LLM has a wide range of applications. It can be used in fields such as natural language processing, text generation, speech recognition and recommendation systems. By learning from large amounts of data, LLM is able to generate text

Can fine-tuning really allow LLM to learn new things: introducing new knowledge may make the model produce more hallucinations Jun 11, 2024 pm 03:57 PM

Large Language Models (LLMs) are trained on huge text databases, where they acquire large amounts of real-world knowledge. This knowledge is embedded into their parameters and can then be used when needed. The knowledge of these models is "reified" at the end of training. At the end of pre-training, the model actually stops learning. Align or fine-tune the model to learn how to leverage this knowledge and respond more naturally to user questions. But sometimes model knowledge is not enough, and although the model can access external content through RAG, it is considered beneficial to adapt the model to new domains through fine-tuning. This fine-tuning is performed using input from human annotators or other LLM creations, where the model encounters additional real-world knowledge and integrates it

Cloud computing giant launches legal battle: Amazon sues Nokia for patent infringement Jul 31, 2024 pm 12:47 PM

According to news from this site on July 31, technology giant Amazon sued Finnish telecommunications company Nokia in the federal court of Delaware on Tuesday, accusing it of infringing on more than a dozen Amazon patents related to cloud computing technology. 1. Amazon stated in the lawsuit that Nokia abused Amazon Cloud Computing Service (AWS) related technologies, including cloud computing infrastructure, security and performance technologies, to enhance its own cloud service products. Amazon launched AWS in 2006 and its groundbreaking cloud computing technology had been developed since the early 2000s, the complaint said. "Amazon is a pioneer in cloud computing, and now Nokia is using Amazon's patented cloud computing innovations without permission," the complaint reads. Amazon asks court for injunction to block

To provide a new scientific and complex question answering benchmark and evaluation system for large models, UNSW, Argonne, University of Chicago and other institutions jointly launched the SciQAG framework Jul 25, 2024 am 06:42 AM

Editor |ScienceAI Question Answering (QA) data set plays a vital role in promoting natural language processing (NLP) research. High-quality QA data sets can not only be used to fine-tune models, but also effectively evaluate the capabilities of large language models (LLM), especially the ability to understand and reason about scientific knowledge. Although there are currently many scientific QA data sets covering medicine, chemistry, biology and other fields, these data sets still have some shortcomings. First, the data form is relatively simple, most of which are multiple-choice questions. They are easy to evaluate, but limit the model's answer selection range and cannot fully test the model's ability to answer scientific questions. In contrast, open-ended Q&A

SOTA performance, Xiamen multi-modal protein-ligand affinity prediction AI method, combines molecular surface information for the first time Jul 17, 2024 pm 06:37 PM

Editor | KX In the field of drug research and development, accurately and effectively predicting the binding affinity of proteins and ligands is crucial for drug screening and optimization. However, current studies do not take into account the important role of molecular surface information in protein-ligand interactions. Based on this, researchers from Xiamen University proposed a novel multi-modal feature extraction (MFE) framework, which for the first time combines information on protein surface, 3D structure and sequence, and uses a cross-attention mechanism to compare different modalities. feature alignment. Experimental results demonstrate that this method achieves state-of-the-art performance in predicting protein-ligand binding affinities. Furthermore, ablation studies demonstrate the effectiveness and necessity of protein surface information and multimodal feature alignment within this framework. Related research begins with "S

SK Hynix will display new AI-related products on August 6: 12-layer HBM3E, 321-high NAND, etc. Aug 01, 2024 pm 09:40 PM

According to news from this site on August 1, SK Hynix released a blog post today (August 1), announcing that it will attend the Global Semiconductor Memory Summit FMS2024 to be held in Santa Clara, California, USA from August 6 to 8, showcasing many new technologies. generation product. Introduction to the Future Memory and Storage Summit (FutureMemoryandStorage), formerly the Flash Memory Summit (FlashMemorySummit) mainly for NAND suppliers, in the context of increasing attention to artificial intelligence technology, this year was renamed the Future Memory and Storage Summit (FutureMemoryandStorage) to invite DRAM and storage vendors and many more players. New product SK hynix launched last year

See all articles