Home Technology peripherals AI Can deep learning be used to replace other image processing algorithms?

Can deep learning be used to replace other image processing algorithms?

Sep 15, 2023 pm 12:13 PM
AI deep learning

With the emergence of deep learning technology, visual image processing has become more and more popular in recent years and has been widely used in various fields. At the same time, a large number of practitioners have emerged. However, many people only use deep learning and think that traditional image processing algorithms are outdated. I once heard someone say that image processing has become so common that traditional algorithms are outdated and the threshold is very low, and anyone can use it. To be honest, when I hear such remarks, sometimes I am really speechless

Can deep learning be used to replace other image processing algorithms?

#I just have some time today and I want to talk about this issue. First, let us analyze, what is image processing mainly used for? No matter which industry image processing is used in, its main functions include identification, classification, positioning, detection, size measurement and visual guidance, etc.

Since some people say that deep learning has replaced traditional image processing technology, then today I Let me give you a few specific cases to see the application of deep learning. At the same time, we should also see whether traditional image processing technology still has its place.

First, let us briefly introduce what the main functions of vision mentioned above are. There is a certain connection between identification and classification. Face recognition, license plate recognition, character recognition, barcode/QR code recognition, product category recognition, fruit recognition, etc. are all image recognition technologies. After the recognition is completed, the recognition results are sometimes given directly, and sometimes classification is required. For example, the products identified on the mixed production line need to be classified and packed

rewritten into Chinese: There are many positioning methods. Sometimes you only need to roughly know the target location, and sometimes you need precise positioning so that the robot can automatically grab it. Detection methods include target detection and defect detection. Target detection usually only needs to know whether the target exists in the scene, while defect detection not only detects whether the defect exists, but also needs to determine the size and category of the defect. The goal of size measurement is very clear, that is, to detect the target object through vision. Whether a specific size meets the requirements. Visual guidance is combined with the robot's automatic grasping. It needs to accurately locate the target position and also determine the specific location of the grab to avoid the target from falling when the robot moves. I will list a few below. A specific case shows how you can use deep learning to achieve it. Since the company does not want these images to be made public, the pictures below only capture a small part of them, and the specific content cannot be determined

All the following examples are actual needs of the company, and the images were also taken on site. Let’s first look at a simple character recognition case. The requirement of this case is to determine whether these characters are correct. 20 characters need to be processed per second. The budget is 20,000 yuan per visual system. There are 100 production lines, and the total cost is 2 million yuan. Do you think it should be done or not? Although 2 million yuan is already a lot of money, each vision system only costs 20,000 yuan. Will it be profitable? In addition, when the host computer recognizes erroneous characters, it needs to automatically eliminate

. A brief analysis shows that 20 products can be detected per second, that is, each product needs to be Completed within 50 milliseconds. At the same time, the host computer needs to send a signal to the rejection mechanism. In order to ensure the stability of the signal, 20 milliseconds need to be reserved. The remaining 30 milliseconds are used for taking pictures and image processing. For deep learning training, the configuration of the industrial computer needs to be considered. In addition, the cost of PLC, rejection device, camera, light source, lens, cabinet and other small accessories need to be considered. How much does manual on-site commissioning cost? What is the total cost?

Can deep learning be used to replace other image processing algorithms?Give another example. The picture below shows welding defect detection. There are many kinds of welding defects. Someone once spent a week using deep learning to train and detect. They told me the test results were great, but then they came back to me a month later and said they couldn't afford the cost and the test results were poor. Think about it, why does this happen?

Give another example. The picture below is an example of barcode recognition. You see that the barcode is very blurry and difficult to identify, right? We can use deep learning to identify it

Can deep learning be used to replace other image processing algorithms?

As an example, let’s take a look at the picture below. We need to check whether the thickness on both sides is consistent and whether there are defects on the surface. So, how can we use deep learning to solve this problem? Someone once debugged on site for half a year, but in the end the customer was not satisfied

Can deep learning be used to replace other image processing algorithms?

How to use deep learning to automatically grasp the robot and detect the tilt angle of the grasped object, so as to adjust the robot's posture?

Can deep learning be used to replace other image processing algorithms?

How to use deep learning to measure the size of a spring that has been used for a period of time to determine whether it is qualified? How to measure other similar parameters of bearings, gears, threads, etc.?

Can deep learning be used to replace other image processing algorithms?

The above examples are only a small part of many actual cases. Similar detection methods are emerging in endlessly, such as automatic bolt tightening, disordered grabbing and micron-level precise positioning. However, unfortunately, many people only use deep learning, and some people assert that image processing is outdated and deep learning has replaced other image processing methods without even understanding the basic image concepts. If you were exposed to more actual cases, you would not say such a thing

Many people mistakenly believe that you only need to input images into the deep learning model for training. If the effect is not good, just increase the training samples or adjust the parameters. can achieve the desired effect. All I can say is that this understanding of images is too superficial. The most common image applications are face recognition and license plate recognition. For these recognition tasks, there is not much problem in using deep learning because the requirements for recognition are not high. Even if the recognition takes a long time or a recognition error occurs, it will not cause too big a problem. For example, with facial recognition payment, if the face cannot be recognized, you can also choose to pay manually; with license plate recognition and access control systems, if the license plate or face cannot be recognized, you can also choose to open the door manually. However, in a fully automatic application scenario, this situation is not allowed.

For the detection, classification and identification of product defects, deep learning is a commonly used method, but it also needs to be decided according to the specific situation. . In addition, before training, it is usually necessary to use other image processing algorithms

Some people just use public data sets to train deep learning models and write articles, which is no problem. However, there is still a long way to go if these models are to be applied in practical applications. A company I was familiar with recruited some Ph.D.s to work on visual inspection, but after half a year there were no results, and they were severely criticized by the leader. Do you know why?

Deep learning has its place in the application field. This is an undeniable fact. However, it is only one part of visual inspection and cannot be implemented in many other aspects. Current visual detection technology can only be applied to some simple scenes. For many complex scenes, no matter what algorithm is used, effective detection cannot be achieved. Therefore, visual image processing algorithms still have a long way to go. When deep learning trains images, it usually requires some processing of the original image, such as filtering, enhancement, threshold segmentation, edge detection and morphological operations. Sometimes, it is also necessary to process the image and then directly extract image features for deep learning training. Many people who are engaged in visual work understand this principle.

It has been mentioned before that if you just want to write a paper, then it is enough to study one direction in depth. In-depth theoretical research is also promising, but it requires higher personal theoretical abilities. If you have not been exposed to actual vision applications, it is best not to casually claim that image processing is outdated, other image processing algorithms are no longer used, and deep learning has replaced other image processing algorithms

So, are we still What about traditional image processing algorithms? I think the answer to this question is clear. If there is still something unclear, please think carefully about how many vision-related projects you have been exposed to, and what else you don’t know. Do you really understand what visual inspection can do and how to do it? Very few visual inspections are achieved solely by relying on a single image processing algorithm. Therefore, at the application level, we need to be proficient in the use of various image processing algorithms and the combined application of various algorithms in order to be able to operate with ease in the visual industry. Summer vacation is coming soon. Summer vacation is a very good time for learning. Use this vacation and this learning platform to quickly master the application of image processing related algorithms

The above is the detailed content of Can deep learning be used to replace other image processing algorithms?. For more information, please follow other related articles on the PHP Chinese website!

Statement of this Website
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

AI Hentai Generator

AI Hentai Generator

Generate AI Hentai for free.

Hot Article

R.E.P.O. Energy Crystals Explained and What They Do (Yellow Crystal)
3 weeks ago By 尊渡假赌尊渡假赌尊渡假赌
R.E.P.O. Best Graphic Settings
3 weeks ago By 尊渡假赌尊渡假赌尊渡假赌
R.E.P.O. How to Fix Audio if You Can't Hear Anyone
3 weeks ago By 尊渡假赌尊渡假赌尊渡假赌
WWE 2K25: How To Unlock Everything In MyRise
1 months ago By 尊渡假赌尊渡假赌尊渡假赌

Hot Tools

Notepad++7.3.1

Notepad++7.3.1

Easy-to-use and free code editor

SublimeText3 Chinese version

SublimeText3 Chinese version

Chinese version, very easy to use

Zend Studio 13.0.1

Zend Studio 13.0.1

Powerful PHP integrated development environment

Dreamweaver CS6

Dreamweaver CS6

Visual web development tools

SublimeText3 Mac version

SublimeText3 Mac version

God-level code editing software (SublimeText3)

Bytedance Cutting launches SVIP super membership: 499 yuan for continuous annual subscription, providing a variety of AI functions Bytedance Cutting launches SVIP super membership: 499 yuan for continuous annual subscription, providing a variety of AI functions Jun 28, 2024 am 03:51 AM

This site reported on June 27 that Jianying is a video editing software developed by FaceMeng Technology, a subsidiary of ByteDance. It relies on the Douyin platform and basically produces short video content for users of the platform. It is compatible with iOS, Android, and Windows. , MacOS and other operating systems. Jianying officially announced the upgrade of its membership system and launched a new SVIP, which includes a variety of AI black technologies, such as intelligent translation, intelligent highlighting, intelligent packaging, digital human synthesis, etc. In terms of price, the monthly fee for clipping SVIP is 79 yuan, the annual fee is 599 yuan (note on this site: equivalent to 49.9 yuan per month), the continuous monthly subscription is 59 yuan per month, and the continuous annual subscription is 499 yuan per year (equivalent to 41.6 yuan per month) . In addition, the cut official also stated that in order to improve the user experience, those who have subscribed to the original VIP

Context-augmented AI coding assistant using Rag and Sem-Rag Context-augmented AI coding assistant using Rag and Sem-Rag Jun 10, 2024 am 11:08 AM

Improve developer productivity, efficiency, and accuracy by incorporating retrieval-enhanced generation and semantic memory into AI coding assistants. Translated from EnhancingAICodingAssistantswithContextUsingRAGandSEM-RAG, author JanakiramMSV. While basic AI programming assistants are naturally helpful, they often fail to provide the most relevant and correct code suggestions because they rely on a general understanding of the software language and the most common patterns of writing software. The code generated by these coding assistants is suitable for solving the problems they are responsible for solving, but often does not conform to the coding standards, conventions and styles of the individual teams. This often results in suggestions that need to be modified or refined in order for the code to be accepted into the application

Can fine-tuning really allow LLM to learn new things: introducing new knowledge may make the model produce more hallucinations Can fine-tuning really allow LLM to learn new things: introducing new knowledge may make the model produce more hallucinations Jun 11, 2024 pm 03:57 PM

Large Language Models (LLMs) are trained on huge text databases, where they acquire large amounts of real-world knowledge. This knowledge is embedded into their parameters and can then be used when needed. The knowledge of these models is "reified" at the end of training. At the end of pre-training, the model actually stops learning. Align or fine-tune the model to learn how to leverage this knowledge and respond more naturally to user questions. But sometimes model knowledge is not enough, and although the model can access external content through RAG, it is considered beneficial to adapt the model to new domains through fine-tuning. This fine-tuning is performed using input from human annotators or other LLM creations, where the model encounters additional real-world knowledge and integrates it

Seven Cool GenAI & LLM Technical Interview Questions Seven Cool GenAI & LLM Technical Interview Questions Jun 07, 2024 am 10:06 AM

To learn more about AIGC, please visit: 51CTOAI.x Community https://www.51cto.com/aigc/Translator|Jingyan Reviewer|Chonglou is different from the traditional question bank that can be seen everywhere on the Internet. These questions It requires thinking outside the box. Large Language Models (LLMs) are increasingly important in the fields of data science, generative artificial intelligence (GenAI), and artificial intelligence. These complex algorithms enhance human skills and drive efficiency and innovation in many industries, becoming the key for companies to remain competitive. LLM has a wide range of applications. It can be used in fields such as natural language processing, text generation, speech recognition and recommendation systems. By learning from large amounts of data, LLM is able to generate text

To provide a new scientific and complex question answering benchmark and evaluation system for large models, UNSW, Argonne, University of Chicago and other institutions jointly launched the SciQAG framework To provide a new scientific and complex question answering benchmark and evaluation system for large models, UNSW, Argonne, University of Chicago and other institutions jointly launched the SciQAG framework Jul 25, 2024 am 06:42 AM

Editor |ScienceAI Question Answering (QA) data set plays a vital role in promoting natural language processing (NLP) research. High-quality QA data sets can not only be used to fine-tune models, but also effectively evaluate the capabilities of large language models (LLM), especially the ability to understand and reason about scientific knowledge. Although there are currently many scientific QA data sets covering medicine, chemistry, biology and other fields, these data sets still have some shortcomings. First, the data form is relatively simple, most of which are multiple-choice questions. They are easy to evaluate, but limit the model's answer selection range and cannot fully test the model's ability to answer scientific questions. In contrast, open-ended Q&A

Five schools of machine learning you don't know about Five schools of machine learning you don't know about Jun 05, 2024 pm 08:51 PM

Machine learning is an important branch of artificial intelligence that gives computers the ability to learn from data and improve their capabilities without being explicitly programmed. Machine learning has a wide range of applications in various fields, from image recognition and natural language processing to recommendation systems and fraud detection, and it is changing the way we live. There are many different methods and theories in the field of machine learning, among which the five most influential methods are called the "Five Schools of Machine Learning". The five major schools are the symbolic school, the connectionist school, the evolutionary school, the Bayesian school and the analogy school. 1. Symbolism, also known as symbolism, emphasizes the use of symbols for logical reasoning and expression of knowledge. This school of thought believes that learning is a process of reverse deduction, through existing

AlphaFold 3 is launched, comprehensively predicting the interactions and structures of proteins and all living molecules, with far greater accuracy than ever before AlphaFold 3 is launched, comprehensively predicting the interactions and structures of proteins and all living molecules, with far greater accuracy than ever before Jul 16, 2024 am 12:08 AM

Editor | Radish Skin Since the release of the powerful AlphaFold2 in 2021, scientists have been using protein structure prediction models to map various protein structures within cells, discover drugs, and draw a "cosmic map" of every known protein interaction. . Just now, Google DeepMind released the AlphaFold3 model, which can perform joint structure predictions for complexes including proteins, nucleic acids, small molecules, ions and modified residues. The accuracy of AlphaFold3 has been significantly improved compared to many dedicated tools in the past (protein-ligand interaction, protein-nucleic acid interaction, antibody-antigen prediction). This shows that within a single unified deep learning framework, it is possible to achieve

SOTA performance, Xiamen multi-modal protein-ligand affinity prediction AI method, combines molecular surface information for the first time SOTA performance, Xiamen multi-modal protein-ligand affinity prediction AI method, combines molecular surface information for the first time Jul 17, 2024 pm 06:37 PM

Editor | KX In the field of drug research and development, accurately and effectively predicting the binding affinity of proteins and ligands is crucial for drug screening and optimization. However, current studies do not take into account the important role of molecular surface information in protein-ligand interactions. Based on this, researchers from Xiamen University proposed a novel multi-modal feature extraction (MFE) framework, which for the first time combines information on protein surface, 3D structure and sequence, and uses a cross-attention mechanism to compare different modalities. feature alignment. Experimental results demonstrate that this method achieves state-of-the-art performance in predicting protein-ligand binding affinities. Furthermore, ablation studies demonstrate the effectiveness and necessity of protein surface information and multimodal feature alignment within this framework. Related research begins with "S

See all articles