Why do pdf files opened using pdf viewer display garbled characters?

WBOY
Release: 2024-01-17 16:18:05
forward
1200 people have browsed it

Why do pdf files opened using pdf viewer display garbled characters?

Why do the pdf files opened by the pdf viewer have garbled characters?

I use CAJViewer

CAJViewer5.5_OCR v5.5.0 Build 4030

Description: With OCR recognition, with multi-language package, OCR recognition supports Chinese and English recognition. Size: 32.911 MB

1) Partial text recognition: directly use the ocr of caj browser

Save the print file in MDI format, and then open the file using Microsoft Office Document Image. Select "Use OCR to recognize text" under the Tools menu to identify text content. After completing the recognition, select "Send Text to Word" under the Tools menu to output the recognition results of the entire PDF file to a Word file.

Please note: Microsoft Office Document Image can recognize and convert Chinese, English and table content very accurately. However, it cannot directly output graphics to a Word document. Instead, it forms all graphics in the file into independent picture files and places them in the same folder with the same folder name as the original file. Therefore, you can use Snagit software to open these graphic files and copy and paste them into Word. (It should be noted that all recognition software cannot handle the problem of pattern recognition well, and the processing method of Microsoft Office Document Image is already one of the best solutions to solve this problem.)

Recommended quick method:

Before extracting text from CAJ files, the following preparations are required: First, make sure that CAJ file browser 5.5 and Office2003 are installed, and the Office tool Microsoft Office Document Imaging is fully installed. Once the installation is complete, you will see the Microsoft Office Document Image Writer printer in the printer list. With Microsoft Office Document Image, you can recognize and convert Chinese, English, table and other document contents with high accuracy. These preparations can ensure that you can successfully extract the text information in the CAJ file.

Identification of CAJ files:

(1) First, download the CAJ format data file from the Internet and save it to the local hard disk.

(2) Then, start the CAJViewer browser program and open the CAJ format file just saved in the program. After browsing the file to the last page, do not close the CAJ browser program.

(3) In the CAJ browser program window, select "File" → "Print", and select the printer as the Microsoft Office Document Image Writer printer, check the print to file option and determine the number of pages to print.

(4) Save the print file (*.prn) to the appropriate location. After waiting for printing to complete, Microsoft Office Document Image automatically opens the print file you just saved.

(5) In the Microsoft Office Document Image window, select the "Select All Pages" menu item in the "Page" menu, and then select "Use OCR to recognize text" in the "Tools" menu to extract text.

(6) Select "Send text to word" under "Tools", and finally the entire CAJ file recognition will be output to the word file.

How to fix garbled characters when opening a word document using wps

Sometimes when you open a Word document, you may see that the document has become a bunch of garbled characters. Don’t worry, you can try the following two methods to save your files.

1. Replacement format method .heike123.com

Is to save the damaged Word document in another format.

1. Open the damaged document and click the "File/Save As" menu. In the "Save Type" list, select "RTF Format", then click the "Save" button and close Word.

2. Open the RTF format file you just saved, and use "Save As" again to save the file as a "Word Document". Now open the word file and you will find that the file has been restored.

If the file still cannot be recovered after converting it to rtf format, you can convert the file to plain text format (*.txt) again, and then convert it back to Word format. Of course, the pictures and other information will be lost when converting to txt file.

How to solve the problem of garbled characters when converting PDF to word document

Some PDF files will be garbled when converted into word documents. I have used a lot of conversion software, but the result is that the text is still garbled. In order to solve this problem, I used the following stupid method:

1. Double-click to open the PDF file. Of course, you must download and install the PDF converter in advance

2. Convert Chinese text in PDF to editable word document. The method is: (in the opened PDF file) click: File-Save As, and after "Save as type", select: "TXT file (*.txt )", select "Desktop" after "Save in", click "Save", open the txt document on the desktop (with the same name as the PDF), select the text, copy and paste it into the word document.

3. Copy the pictures in the PDF to the word document. The method is: (in the open PDF file) click: Tools-Snapshot (if the picture is larger, please click the "Reduce" tool in the second line to until you can see the whole picture), select the picture (press and hold the left button of the mouse in the upper left corner of the picture, drag to the lower right corner, then a dotted box should appear, release the mouse), in the open word document Paste in place (Ctrl V).

4. At this time, you can edit the text in the word document to what you want. Of course, the pictures in it can only be formatted and cannot be edited.

The above 2 can also be done like this: (in the open PDF file), click: Tools-Text Viewer (the text in the PDF is already in text form), then right-click "Select All"-"Copy", Just "paste" it into Word. Although this method is page by page, it can be similar to the original layout in the word document. Then click: Tools-Text Viewer (you can also click Alt 9 repeatedly) to enter the PDF reader interface (or text interface).

Steps to use the online PDF to Word converter:

Step one: Upload the PDF file that needs to be converted. It will show that the file you uploaded is successful. Click to generate a word document;

Step 2: Wait for server processing;

Step 3: Download the word document and save it on your computer.

The above is the detailed content of Why do pdf files opened using pdf viewer display garbled characters?. For more information, please follow other related articles on the PHP Chinese website!

source:docexcel.net
Statement of this Website
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn
Popular Tutorials
More>
Latest Downloads
More>
Web Effects
Website Source Code
Website Materials
Front End Template
About us Disclaimer Sitemap
php.cn:Public welfare online PHP training,Help PHP learners grow quickly!