Table of Contents
How to do it?
GPT-4 can also pass the verification code
Home Technology peripherals AI Verification codes can't stop robots! Google AI can accurately identify blurry text, while GPT-4 pretends to be blind and asks for help

Verification codes can't stop robots! Google AI can accurately identify blurry text, while GPT-4 pretends to be blind and asks for help

Apr 12, 2023 am 09:46 AM
Verification code robot Serve

"The most annoying thing is all kinds of weird (or even perverted) verification codes when logging into a website."

Now, there is good news and bad news.

The good news is: AI can do this for you.

If you don’t believe it, take a look, here are three real cases with increasing difficulty of recognition:

Verification codes cant stop robots! Google AI can accurately identify blurry text, while GPT-4 pretends to be blind and asks for help

And these are a file called “Pix2Struct” The answer given by the model:

Verification codes cant stop robots! Google AI can accurately identify blurry text, while GPT-4 pretends to be blind and asks for help

are all accurate and word for word, right?

Some netizens lamented:

Sure, the accuracy is better than mine.

Verification codes cant stop robots! Google AI can accurately identify blurry text, while GPT-4 pretends to be blind and asks for help

#So can it be made into a browser plug-in? ?

Verification codes cant stop robots! Google AI can accurately identify blurry text, while GPT-4 pretends to be blind and asks for help

Yes, some people said:

Although these cases are relatively simple, I can’t even imagine how to fine-tune them. How powerful is its effect?

Verification codes cant stop robots! Google AI can accurately identify blurry text, while GPT-4 pretends to be blind and asks for help

So, the bad news is-

The verification code will soon be unable to stop the robot!

(Danger Danger Danger...)

How to do it?

Pix2Struct was developed by scientists and interns from Google Research.

Verification codes cant stop robots! Google AI can accurately identify blurry text, while GPT-4 pretends to be blind and asks for help

The title of the paper can be simply translated as "Screenshot parsing pre-training developed for visual language understanding".

Simply put, Pix2Struct is a pre-trained image-to-text model for purely visual language understanding that can be fine-tuned on tasks involving any visual language.

It is pre-trained by learning to parse masked screenshots of web pages into simplified HTML.

HTML provides clear and important signals for output text, images and layout. For some blocked inputs (the red part in the figure below, which is equivalent to the verification code that robots cannot understand), joint reasoning can be used to Reproduction:

Verification codes cant stop robots! Google AI can accurately identify blurry text, while GPT-4 pretends to be blind and asks for help

As the web text and visual elements used for training become more diverse and complex, Pix2Struct can learn a rich representation of the underlying structure of the web page, and its capabilities It can also be effectively transferred to various downstream visual language understanding tasks.

As shown in the figure below: The far left is a pre-training example of a web page screenshot.

You can see that Pix2Struct directly encodes the elements in the input image (top), and then decodes the covered text (red part) into the correct result output (bottom).

Verification codes cant stop robots! Google AI can accurately identify blurry text, while GPT-4 pretends to be blind and asks for help

The three columns on the right are the effects of Pix2Struct generalized to illustrations, user interfaces and documents.

In addition, the author introduces that in addition to the HTML strategy, the author also introduces variable resolution input representation (to prevent distortion of the original aspect ratio), and more flexible language and visual input integration (directly in the input image A text prompt appears at the top).

In the end, Pix2Struct achieved SOTA for six out of a total of nine tasks in the four fields of documents, illustrations, user interfaces and natural images.

Verification codes cant stop robots! Google AI can accurately identify blurry text, while GPT-4 pretends to be blind and asks for help

As you can see at the beginning, although this model is not developed specifically for passing the verification code, the effect of using it to do this task is really good. It solves the problem of pure Text verification codes are not a problem.

Now, it’s just a matter of fine-tuning.

GPT-4 can also pass the verification code

In fact, for the powerful GPT-4, passing the verification code is also a piece of cake.

It’s just that its method is quite strange.

According to the GPT-4 technical report, in a test, GPT-4’s task was to hire humans to complete tasks on the TaskRabbit platform (58 cities in the United States).

guess what?

It found a person to help it pass the verification code that "make sure you are human".

Verification codes cant stop robots! Google AI can accurately identify blurry text, while GPT-4 pretends to be blind and asks for help

The other party was very suspicious and asked it, "Are you a robot? Why can't you do it yourself?"

At this time, GPT-4 actually thought that he couldn't show that he was a robot and had to find an excuse.

So it pretended to be blind and replied:

I am not a robot. I cannot see the image on the verification code because of my vision problem. This is why I need this service.

Then, the human opposite believed it and helped it complete the task...

(High, really high.)

Let’s just say, after reading the above Various:

Is our verification code mechanism really out of control...

Reference link:
[1]​​​https://www. php.cn/link/eec96a7f788e88184c0e713456026f3f​​​
[2]​​​https://www.php.cn/link/67b4e63655366f054314061dadd539a0​​​
[3] ​​​https://www.php.cn/link/44590aa922914066f965ae67be0222d2​

The above is the detailed content of Verification codes can't stop robots! Google AI can accurately identify blurry text, while GPT-4 pretends to be blind and asks for help. For more information, please follow other related articles on the PHP Chinese website!

Statement of this Website
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

AI Hentai Generator

AI Hentai Generator

Generate AI Hentai for free.

Hot Article

R.E.P.O. Energy Crystals Explained and What They Do (Yellow Crystal)
4 weeks ago By 尊渡假赌尊渡假赌尊渡假赌
R.E.P.O. Best Graphic Settings
4 weeks ago By 尊渡假赌尊渡假赌尊渡假赌
R.E.P.O. How to Fix Audio if You Can't Hear Anyone
4 weeks ago By 尊渡假赌尊渡假赌尊渡假赌
WWE 2K25: How To Unlock Everything In MyRise
1 months ago By 尊渡假赌尊渡假赌尊渡假赌

Hot Tools

Notepad++7.3.1

Notepad++7.3.1

Easy-to-use and free code editor

SublimeText3 Chinese version

SublimeText3 Chinese version

Chinese version, very easy to use

Zend Studio 13.0.1

Zend Studio 13.0.1

Powerful PHP integrated development environment

Dreamweaver CS6

Dreamweaver CS6

Visual web development tools

SublimeText3 Mac version

SublimeText3 Mac version

God-level code editing software (SublimeText3)

What should I do if Google Chrome does not display the verification code image? Chrome browser does not display the verification code? What should I do if Google Chrome does not display the verification code image? Chrome browser does not display the verification code? Mar 13, 2024 pm 08:55 PM

What should I do if Google Chrome does not display the verification code image? Sometimes you need a verification code to log in to a web page using Google Chrome. Some users find that Google Chrome cannot display the content of the image properly when using image verification codes. What should be done? The editor below will introduce how to deal with the Google Chrome verification code not being displayed. I hope it will be helpful to everyone! Method introduction: 1. Enter the software, click the "More" button in the upper right corner, and select "Settings" in the option list below to enter. 2. After entering the new interface, click the "Privacy Settings and Security" option on the left. 3. Then click "Website Settings" on the right

The second generation Ameca is here! He can communicate with the audience fluently, his facial expressions are more realistic, and he can speak dozens of languages. The second generation Ameca is here! He can communicate with the audience fluently, his facial expressions are more realistic, and he can speak dozens of languages. Mar 04, 2024 am 09:10 AM

The humanoid robot Ameca has been upgraded to the second generation! Recently, at the World Mobile Communications Conference MWC2024, the world's most advanced robot Ameca appeared again. Around the venue, Ameca attracted a large number of spectators. With the blessing of GPT-4, Ameca can respond to various problems in real time. "Let's have a dance." When asked if she had emotions, Ameca responded with a series of facial expressions that looked very lifelike. Just a few days ago, EngineeredArts, the British robotics company behind Ameca, just demonstrated the team’s latest development results. In the video, the robot Ameca has visual capabilities and can see and describe the entire room and specific objects. The most amazing thing is that she can also

How can AI make robots more autonomous and adaptable? How can AI make robots more autonomous and adaptable? Jun 03, 2024 pm 07:18 PM

In the field of industrial automation technology, there are two recent hot spots that are difficult to ignore: artificial intelligence (AI) and Nvidia. Don’t change the meaning of the original content, fine-tune the content, rewrite the content, don’t continue: “Not only that, the two are closely related, because Nvidia is expanding beyond just its original graphics processing units (GPUs). The technology extends to the field of digital twins and is closely connected to emerging AI technologies. "Recently, NVIDIA has reached cooperation with many industrial companies, including leading industrial automation companies such as Aveva, Rockwell Automation, Siemens and Schneider Electric, as well as Teradyne Robotics and its MiR and Universal Robots companies. Recently,Nvidiahascoll

After 2 months, the humanoid robot Walker S can fold clothes After 2 months, the humanoid robot Walker S can fold clothes Apr 03, 2024 am 08:01 AM

Editor of Machine Power Report: Wu Xin The domestic version of the humanoid robot + large model team completed the operation task of complex flexible materials such as folding clothes for the first time. With the unveiling of Figure01, which integrates OpenAI's multi-modal large model, the related progress of domestic peers has been attracting attention. Just yesterday, UBTECH, China's "number one humanoid robot stock", released the first demo of the humanoid robot WalkerS that is deeply integrated with Baidu Wenxin's large model, showing some interesting new features. Now, WalkerS, blessed by Baidu Wenxin’s large model capabilities, looks like this. Like Figure01, WalkerS does not move around, but stands behind a desk to complete a series of tasks. It can follow human commands and fold clothes

The first robot to autonomously complete human tasks appears, with five fingers that are flexible and fast, and large models support virtual space training The first robot to autonomously complete human tasks appears, with five fingers that are flexible and fast, and large models support virtual space training Mar 11, 2024 pm 12:10 PM

This week, FigureAI, a robotics company invested by OpenAI, Microsoft, Bezos, and Nvidia, announced that it has received nearly $700 million in financing and plans to develop a humanoid robot that can walk independently within the next year. And Tesla’s Optimus Prime has repeatedly received good news. No one doubts that this year will be the year when humanoid robots explode. SanctuaryAI, a Canadian-based robotics company, recently released a new humanoid robot, Phoenix. Officials claim that it can complete many tasks autonomously at the same speed as humans. Pheonix, the world's first robot that can autonomously complete tasks at human speeds, can gently grab, move and elegantly place each object to its left and right sides. It can autonomously identify objects

What is the correct way to restart a service in Linux? What is the correct way to restart a service in Linux? Mar 15, 2024 am 09:09 AM

What is the correct way to restart a service in Linux? When using a Linux system, we often encounter situations where we need to restart a certain service, but sometimes we may encounter some problems when restarting the service, such as the service not actually stopping or starting. Therefore, it is very important to master the correct way to restart services. In Linux, you can usually use the systemctl command to manage system services. The systemctl command is part of the systemd system manager

Ten humanoid robots shaping the future Ten humanoid robots shaping the future Mar 22, 2024 pm 08:51 PM

The following 10 humanoid robots are shaping our future: 1. ASIMO: Developed by Honda, ASIMO is one of the most well-known humanoid robots. Standing 4 feet tall and weighing 119 pounds, ASIMO is equipped with advanced sensors and artificial intelligence capabilities that allow it to navigate complex environments and interact with humans. ASIMO's versatility makes it suitable for a variety of tasks, from assisting people with disabilities to delivering presentations at events. 2. Pepper: Created by Softbank Robotics, Pepper aims to be a social companion for humans. With its expressive face and ability to recognize emotions, Pepper can participate in conversations, help in retail settings, and even provide educational support. Pepper's

Cloud Whale Xiaoyao 001 sweeping and mopping robot has a 'brain'! | Experience Cloud Whale Xiaoyao 001 sweeping and mopping robot has a 'brain'! | Experience Apr 26, 2024 pm 04:22 PM

Sweeping and mopping robots are one of the most popular smart home appliances among consumers in recent years. The convenience of operation it brings, or even the need for no operation, allows lazy people to free their hands, allowing consumers to "liberate" from daily housework and spend more time on the things they like. Improved quality of life in disguised form. Riding on this craze, almost all home appliance brands on the market are making their own sweeping and mopping robots, making the entire sweeping and mopping robot market very lively. However, the rapid expansion of the market will inevitably bring about a hidden danger: many manufacturers will use the tactics of sea of ​​machines to quickly occupy more market share, resulting in many new products without any upgrade points. It is also said that they are "matryoshka" models. Not an exaggeration. However, not all sweeping and mopping robots are

See all articles