When it comes to AI infrastructure, should you choose edge or cloud?
Nowadays, edge computing has been a hot topic. Hailed as the most exciting technological shift in recent years, there’s a lot of talk about its transformative power!
With increasingly powerful AI/ML algorithms redefining “intelligence” and the availability of cheaper, more powerful “edge” devices, the hype is largely real. But if you think about the history of edge computing, it's much earlier than recent interest would lead us to believe.
In fact, computing and intelligence originally started at the edge, when high-bandwidth network connectivity was almost non-existent for most applications. Even in the late 1990s, critical measurement equipment deployed remotely in factories or fields often had dedicated computing power to process incoming sensor data. However, the algorithms in these devices are only rudimentary in terms of "intelligence": primarily signal processing or data conversion.
Cloud-based computing began to gain traction in the late 2000s as network capabilities improved and connectivity increased. At the same time, powerful artificial intelligence algorithms are gaining traction as a means of unlocking meaningful information from large amounts of structured and unstructured data. In just a decade, cloud AI has become the go-to choice for AI applications.
However, the shift to the cloud also brings several issues: data upload and download costs, network reliability and data security, etc. At the same time, edge computing’s trade-offs between processing power and cost or floor space are diminishing with the rise of affordable yet powerful edge devices. It seems we are now back to considering “edge computing” as a viable and attractive option for building smart applications.
As the debate rages over which is better, edge AI or cloud AI, anyone familiar with both frameworks is likely to respond “It depends.” !". The reason is that edge and cloud infrastructure are not competing, but complementary frameworks.
Both have experienced tremendous development and refinement over the past few years, particularly as a foundation for artificial intelligence development and deployment. As with any technology choice, the choice really comes down to the specific application: goals, value drivers, economics, and any constraints on power consumption, size, and connectivity. Therefore, it’s important to understand the pros and cons of cloud and edge AI before trying to set up the right infrastructure.
Cloud-based AI is an attractive option when looking for flexibility, scalability, and ease of deployment. Today, most cloud service providers offer powerful frameworks for training and deployment of AI models and offer pay-as-you-go packages with little upfront commitment or investment. The cloud offers virtually unlimited computing and storage options, making it particularly suitable for large AI models.
But for real-time applications that require continuous evaluation of sensor or image data, this can become an unwieldy option because they must transmit data back and forth, incurring huge costs. This data transfer also makes the cloud largely unsuitable for low-latency applications that require closed-loop control or instant action.
On the other hand, edge AI is a reasonable choice for real-time data analysis for automatic alarms or closed-loop control. While edge infrastructure does require an upfront investment in edge hardware, operating costs are much lower compared to the cloud.
Today, there are a variety of edge AI hardware options available, including NPUs (Neural Processing Units), TPUs (Tensor Processing Units), and SOCs (Systems on a Chip) with dedicated AI accelerators and SoMs (systems on modules). Low-cost and low-power hardware for artificial intelligence is an active area of research and has the potential to provide better options. On the other hand, AI-based consumer applications must deal with a fairly diverse range of edge devices (phones, tablets, PCs, etc.), making edge deployment a potentially daunting prospect.
As a result, edge infrastructure may not be conducive to rapid prototyping or easy to scale. While the concept of federated learning, distributed training of AI models allows for training and deployment at the edge, the cloud remains the logical choice for training large models that require sufficient computing power.
But the solution doesn’t have to be an either/or choice. As applications transition to more microservices-based architectures, they can be broken down into smaller functions or microservices with their own specific deployment frameworks. So instead of having to choose between cloud and edge, the focus can be on using both optimally for specific applications.
For example, an application might start as a rapid prototype on the cloud. As it evolves, functions that require low latency and real-time decision-making can be moved to the edge, while those that require scale and flexibility can remain in the cloud. Model training or retraining can be managed centrally in the cloud, while some federated learning at the edge can improve accuracy locally. Likewise, sensitive data can be processed at the edge and more general data can be moved to the cloud.
Organizations, developers, and practitioners would be better off thinking of cloud and edge not as distinct alternatives, but as a continuum from edge to cloud, with many different infrastructure options in between. This includes different types of edges – operational edge, network edge, mobile endpoints, etc., as well as different types of distributed processing on the network – private clouds, public clouds, cloudlets, fog computing, etc.
While complexity can be a challenge, finding the right combination of technologies begins to offer organizations a unique opportunity to maximize the value of AI while minimizing costs and risks.
The above is the detailed content of When it comes to AI infrastructure, should you choose edge or cloud?. For more information, please follow other related articles on the PHP Chinese website!

Hot AI Tools

Undresser.AI Undress
AI-powered app for creating realistic nude photos

AI Clothes Remover
Online AI tool for removing clothes from photos.

Undress AI Tool
Undress images for free

Clothoff.io
AI clothes remover

AI Hentai Generator
Generate AI Hentai for free.

Hot Article

Hot Tools

Notepad++7.3.1
Easy-to-use and free code editor

SublimeText3 Chinese version
Chinese version, very easy to use

Zend Studio 13.0.1
Powerful PHP integrated development environment

Dreamweaver CS6
Visual web development tools

SublimeText3 Mac version
God-level code editing software (SublimeText3)

Hot Topics

This site reported on June 27 that Jianying is a video editing software developed by FaceMeng Technology, a subsidiary of ByteDance. It relies on the Douyin platform and basically produces short video content for users of the platform. It is compatible with iOS, Android, and Windows. , MacOS and other operating systems. Jianying officially announced the upgrade of its membership system and launched a new SVIP, which includes a variety of AI black technologies, such as intelligent translation, intelligent highlighting, intelligent packaging, digital human synthesis, etc. In terms of price, the monthly fee for clipping SVIP is 79 yuan, the annual fee is 599 yuan (note on this site: equivalent to 49.9 yuan per month), the continuous monthly subscription is 59 yuan per month, and the continuous annual subscription is 499 yuan per year (equivalent to 41.6 yuan per month) . In addition, the cut official also stated that in order to improve the user experience, those who have subscribed to the original VIP

Improve developer productivity, efficiency, and accuracy by incorporating retrieval-enhanced generation and semantic memory into AI coding assistants. Translated from EnhancingAICodingAssistantswithContextUsingRAGandSEM-RAG, author JanakiramMSV. While basic AI programming assistants are naturally helpful, they often fail to provide the most relevant and correct code suggestions because they rely on a general understanding of the software language and the most common patterns of writing software. The code generated by these coding assistants is suitable for solving the problems they are responsible for solving, but often does not conform to the coding standards, conventions and styles of the individual teams. This often results in suggestions that need to be modified or refined in order for the code to be accepted into the application

To learn more about AIGC, please visit: 51CTOAI.x Community https://www.51cto.com/aigc/Translator|Jingyan Reviewer|Chonglou is different from the traditional question bank that can be seen everywhere on the Internet. These questions It requires thinking outside the box. Large Language Models (LLMs) are increasingly important in the fields of data science, generative artificial intelligence (GenAI), and artificial intelligence. These complex algorithms enhance human skills and drive efficiency and innovation in many industries, becoming the key for companies to remain competitive. LLM has a wide range of applications. It can be used in fields such as natural language processing, text generation, speech recognition and recommendation systems. By learning from large amounts of data, LLM is able to generate text

Large Language Models (LLMs) are trained on huge text databases, where they acquire large amounts of real-world knowledge. This knowledge is embedded into their parameters and can then be used when needed. The knowledge of these models is "reified" at the end of training. At the end of pre-training, the model actually stops learning. Align or fine-tune the model to learn how to leverage this knowledge and respond more naturally to user questions. But sometimes model knowledge is not enough, and although the model can access external content through RAG, it is considered beneficial to adapt the model to new domains through fine-tuning. This fine-tuning is performed using input from human annotators or other LLM creations, where the model encounters additional real-world knowledge and integrates it

According to news from this site on July 31, technology giant Amazon sued Finnish telecommunications company Nokia in the federal court of Delaware on Tuesday, accusing it of infringing on more than a dozen Amazon patents related to cloud computing technology. 1. Amazon stated in the lawsuit that Nokia abused Amazon Cloud Computing Service (AWS) related technologies, including cloud computing infrastructure, security and performance technologies, to enhance its own cloud service products. Amazon launched AWS in 2006 and its groundbreaking cloud computing technology had been developed since the early 2000s, the complaint said. "Amazon is a pioneer in cloud computing, and now Nokia is using Amazon's patented cloud computing innovations without permission," the complaint reads. Amazon asks court for injunction to block

Editor |ScienceAI Question Answering (QA) data set plays a vital role in promoting natural language processing (NLP) research. High-quality QA data sets can not only be used to fine-tune models, but also effectively evaluate the capabilities of large language models (LLM), especially the ability to understand and reason about scientific knowledge. Although there are currently many scientific QA data sets covering medicine, chemistry, biology and other fields, these data sets still have some shortcomings. First, the data form is relatively simple, most of which are multiple-choice questions. They are easy to evaluate, but limit the model's answer selection range and cannot fully test the model's ability to answer scientific questions. In contrast, open-ended Q&A

Editor | KX In the field of drug research and development, accurately and effectively predicting the binding affinity of proteins and ligands is crucial for drug screening and optimization. However, current studies do not take into account the important role of molecular surface information in protein-ligand interactions. Based on this, researchers from Xiamen University proposed a novel multi-modal feature extraction (MFE) framework, which for the first time combines information on protein surface, 3D structure and sequence, and uses a cross-attention mechanism to compare different modalities. feature alignment. Experimental results demonstrate that this method achieves state-of-the-art performance in predicting protein-ligand binding affinities. Furthermore, ablation studies demonstrate the effectiveness and necessity of protein surface information and multimodal feature alignment within this framework. Related research begins with "S

Machine learning is an important branch of artificial intelligence that gives computers the ability to learn from data and improve their capabilities without being explicitly programmed. Machine learning has a wide range of applications in various fields, from image recognition and natural language processing to recommendation systems and fraud detection, and it is changing the way we live. There are many different methods and theories in the field of machine learning, among which the five most influential methods are called the "Five Schools of Machine Learning". The five major schools are the symbolic school, the connectionist school, the evolutionary school, the Bayesian school and the analogy school. 1. Symbolism, also known as symbolism, emphasizes the use of symbols for logical reasoning and expression of knowledge. This school of thought believes that learning is a process of reverse deduction, through existing
