What does finereader do? How to recognize text using ABBYY FineReader: step-by-step instructions


    In order to use the program ABBYY FineReader which is designed for text recognition from non-editable and graphic formats. First you need to download it and install it on your computer, and then watch the video below, everything is described in detail about this program.

    This program is designed to scan text and work and recognize it.

    Of course, it can be used, and to carry out this use, you can, without leaving the Finereader program itself, in which you are working, recognize the text of the file and subsequently transform it from a scanned copy of the document into a classic format, Word programs. Then it will turn out to be for your use.

    Finereader is a program for scanning and text recognition with export of information to popular office packages. The principle of working with it can be described in a nutshell as follows: take a sheet of paper with printed text, scan it with a scanner, and get a certain graphic file raster format. Then, without leaving the Finereader program, we recognize the text of the file and the next step is to make a Word format document from the scanned copy. Before this, the recognized text can be viewed and edited. The resulting Word document can be further supplemented and edited.

    The Abbyyfinereader program is undoubtedly the leader among similar programs.

    It has very broad capabilities for recognizing text from non-editable and graphic formats.

    The program will be able to recognize text from such basic formats as (non-editable pdf, digital formats jpeg files, jpg, Djvu, gif, png, etc.).

    Also, ABBYY FineReader works well with almost all scanner models.

    The main functions of the program are:

    Scan documents to formats: Microsoft Word, Microsoft Excel, Pdf, scan and save images, PDF or image to Microsoft Word, convert photo to Microsoft Word.

    ABBYY Finereader work area:

    For adding new task, you must click on the **new task** button, which is located in the upper left part of the program work area.

    Will open window new task

    In the window that opens, you need to select the task you want to perform.

    Let's say we have a photo of a document that we want to convert into a Microsoft Word document format. To do this in the window new task find the active inscription Convert photo to Microsoft Word and click on this inscription. Will open program explorer window with preview :

    In the window that opens, select a photo text file which needs to be recognized and converted into the format you need.

    Will open window with recognition process scale:

    After the program processes the photo and tries to recognize the text.

    You will see the following:

    Here you can select the area of ​​your photo for text recognition.

    After selecting the area, click the button recognize which is located in top menu programs. The program will begin converting the selected photo into text. After processing the image, click on the arrow next to the button save and select the desired format to create a text document:

    Powerful and functional program ABBYY FineReader, intended for high-quality scanning and accurate recognition (this depends on the resolution set when scanning) of various paper media with printed text (books, magazines, newspapers, etc.), as well as digital images.

    The program supports various languages recognition, can save in: Microsoft Word, PDF, image formats and other formats. Since the program has intuitive interface, it is convenient to work with her.

    So, the first thing you need to do is set the settings and scan document, we get an image whose text follows the program recognize. After recognition, you can correct the text (if there are any inaccuracies) and save it in the desired format.

Translating text to digital format- a fairly common task for those who work with documents. The Abbyy Finereader program will help you save a lot of time by automatically translating inscriptions from raster images or “readers” into editable text.

In this article we will look at how to use Abbyy Finereader for text recognition.

How to recognize text from a picture using Abbyy Finereader

In order to recognize text on raster image, you just need to load it into the program, and Abbyy Finereader will automatically recognize the text. All you have to do is edit it, highlight what you need and save it in the required format or copy it into a text editor.

You can recognize text directly from a connected scanner.

Read more on our website.

How to create a PDF and FB2 document using Abbyy Finereader

Abbyy Finereader allows you to convert images into universal PDF format and FB2 format for reading on e-readers and tablets.

The process for creating such documents is similar.

1. In the main menu of the program, select the E-Book section and press FB2. Select the source document type—scan, document, or photo.

2. Find and open the required document. It will load into the program page by page (this may take some time).

3. When the recognition process is completed, the program will prompt you to select a format for saving. Select FB2. If necessary, go to “Options” and enter Additional information(author, title, keywords, description).

After saving, you can remain in text editing mode and convert it to Word format or PDF.

Features of text editing in Abbyy Finereader

There are several options for text that Abbyy Finereader recognizes.

In the original document, save the pictures and footers so that they are transferred to the new document.

Analyze the document to know what errors and problems may arise during the conversion process.

Edit the page image. Options for cropping, photo correction, and changing resolution are available.

So we told you how to use Abbyy Finereader. It has quite broad capabilities for editing and converting texts. Let this program help you create any documents you need.

One of the most popular functionality for working with scanning and file processing various types- Fine Reader. Functional software product was developed Russian company ABBYY, it allows you not only to recognize, but also to process documents (translate, change formats, etc.). Many users can only install it, but cannot immediately figure out how to use ABBYY FineReader. You can find answers to many questions in this article.

The program allows you to scan and recognize text - and more

To understand in detail what kind of program ABBYY FineReader 12 is, you need to consider in detail all its capabilities. The first and simplest function is to scan a document. There are two scanning options: with and without recognition. If you scan a printed sheet normally, you will get the image you scanned in specified folder on your computing device.

ATTENTION. The sheet must be placed evenly on the scanning part of the printer, along the contours indicated on the printer. Do not allow the source code to be twisted, this may lead to poor quality final scan.

You must decide for yourself why you need FineReader, since the utility has significant functionality, for example, you can independently choose what color you want the image to be in, it is possible to convert all photos to black and white. In black and white, recognition is faster and the quality of processing increases.

If you are interested in the text recognition function of ABBYY FineReader, before scanning you need to click special button. In this case, there are several options for obtaining information. As standard, a recognized piece of sheet will be displayed on your screen, which you can copy or edit manually.

If you select other functions, you can immediately receive the file as a Word document or Excel table. Selecting functions is very simple, the menu is intuitive and easy to customize due to the fact that all the buttons you need are in front of your eyes.

IMPORTANT. Before you recognize text ABBYY FineReader, you need to accurately select the processing language. Despite the fact that the utility works completely automatically, it happens that the low quality of the source does not allow us to understand what kind of language was in the source. This greatly reduces the quality of the final results of the application.

Multiple operating modes

To fully understand how to use ABBYY FineReader 12, you need to try two modes of operation: “Careful” and “Quick recognition”. The second mode is suitable for high-quality images, and the first for low-quality files. The Thorough mode takes 3-5 times longer to process files.

The illustration shows the result of the program - text recognition from an image

What other functions are there?

Text recognition in ABBYY FineReader is not the only one useful feature. For greater user convenience, there is

Hello. Today I will talk about how to use the Abbyy FineReader program to recognize text from an image that you may have received as a result of scanning. Your scanned text will be completely in a Microsoft Word document and this recognized text can be edited! Recognizing text using Abbyy Finereader can be useful for those who study, work with texts and translations. The program, unfortunately, is paid. I once had a chance to try one of free options similar programs, but very well scanned text is recognized simply terribly... And text recognition in Abbyy FineReader turns out to be very high quality! Now I will show you how to use the Abbyy FineReader program to quickly recognize text from an image.

ABBYY FineReader has a trial version for 30 days with the ability to recognize up to 100 pages and save no more than 3 pages from a document. Those. During this time, you can see the capabilities of the program and make an informed decision - whether you need it, whether it’s worth buying or not.

How to install Abbyy FineReader!

Before using Abbyy Finereader you need to install it. Let's look at the installation process of this program...

First, select the program language. Click "OK".

We accept the terms license agreement(If you wish, you can read the license agreement if you are interested in what it is about). Click “Next”.

Next, you must select the installation mode. At normal mode the program will not ask you and will install what is specified in the program by default, namely all components: the Abbyy Finereader text recognition program itself, a component for Microsoft Office programs and a component for Windows Explorer (which allows you to quickly recognize images without opening the program separately) . I advise you to check custom installation to configure it the way you need. Moreover, it won’t take even 15 minutes :) Below is the folder where the program will be installed. It is advisable to leave the default selection so that there are no problems later when using the program. Click “Next”.

Program components. This window will appear if you select the “Custom” installation type. Components are something like auxiliary applications for a program. The first component “Integration with Microsoft programs Office and Windows Explorer" This component will be displayed in the Microsoft Office menu and if you click on the image on your computer right click mouse, then there will be an item with this program. This is what your menu will look like in Microsoft Office after adding this component.

Here's what happens if you right-click on the image:

Those. A menu will appear in which you can do quick text recognition and send the results to Word, Excel or PDF.

The second component will allow you to recognize text from your computer screen. This means that you can take a screenshot and also recognize the text. If you do not want to install one of these components, or do not want to install both, then you need to click on the down arrow and select “This component will not be available.” Then the component will not be installed. I left both.

Next 4 points. The first means that information about how you use the Abbyy Finereader program will be transferred to the developer. I advise you not to check this item so that the program does not once again go online to send information about working with it. Moreover, you never know what other information will be sent :) The 2nd point creates a shortcut to the program on the desktop. The 3rd means that the program will start when the computer is turned on, and the 4th will check for program updates. I leave only the second one and leave a tick next to it. Closing everything Microsoft applications Office, because the installer requires it and click “Install”.

You need to wait a couple of minutes for the program to load and click “Next”.

That's it, installation is complete! Click “Finish”.

How can I use Abbyy Finereader to recognize text from a scanned or any other image?

Let's look at how to use the program. For example, you have scanned text. Now, to recognize text in Abbyy FineReader, open the program. Click “Open”.

Select the image we need and click open.

When you open required document, Abbyy Finereader will begin to recognize the text. The larger the document, the longer recognition will take. Recognition of one page may take several seconds.

After the text is recognized, all you have to do is save the result in Microsoft document Word so you can then edit anything in it. To do this, click the “Save” button on top panel tools, then select in which folder it will be saved. Word document and under what name.

If you have a scanner connected to your computer, then you can start scanning directly from the program, and after which the scanned document will be immediately recognized. To do this, click the “Scan” button on the top toolbar. Next steps will depend on the driver program for your printer. You only need to follow the instructions of the scanning wizard.

As you can see, everything is very simple and fast. Now you know how to use Abbyy FineReader to recognize text from images! I hope this information will help a lot of people :) Good luck!

The history of Abbyy FineReader goes back more than 20 years. The company celebrated the anniversary of 2013 with the release of a full-fledged (compared to Express Edition from 2009) Abbyy FineReader Pro for Mac, and a couple of months later, in February 2014, they also received their “gift” Windows users- Abbyy FineReader 12 Professional and Corporate. Let me remind you that the previous version appeared back in 2011, and two and a half years is a long time - let's figure out how significant the changes are.

general information

System requirements for new version have not changed at all. The platform can be Windows or Windows Server starting from XP and 2003 respectively. Hardware requirements are even more modest these days: a processor of any capacity with a frequency of 1 GHz or more, random access memory at least 1 GB plus 512 MB for each computing core, etc. Only the need for disk space- now installation requires not 700, but 850 MB (plus, as before, another 700 MB for working files).

Naturally, we're talking about O minimum requirements; the full capabilities of Abbyy FineReader 12 Professional will be revealed only at relatively modern systems. In particular, let me remind you that the program can effectively parallelize processing individual pages, uses all processor cores and loads any processor almost 100%. But it’s really not greedy when it comes to RAM, and even remains 32-bit.

The installation procedure has not changed either: a minimum of questions and options. Abbyy FineReader 12 Professional still comes with Abbyy Screenshot Reader, which becomes operational only after user registration.

After this, you will also have access to technical support.

Even on the basis of this modest information, we can assume that this is the result of evolution. Accordingly, in what follows I will focus on describing the changes compared to previous version, which can be divided into two main groups: working with the program (interface, auxiliary tools, ease of use) and OCR (quality and performance of the recognition itself).

Working with the program

Abbyy FineReader 12 Professional demonstrates some improvements in the user interface. This is immediately noticeable in the Tasks window, which opens by default when the program starts. It obviously imitates the concept Windows tiles 8.x and is adapted for finger control, especially since the program also supports basic gestures like scrolling and zooming. In fact, the changes affected only the “facade”, and only partly - next to the tiles there are regular controls and in the process of setting up any scenario you will have to deal with standard ones dialog boxes. Working with them with your fingers is quite problematic, especially on 8-10″ screens, which are becoming popular with Windows tablets.

It’s really not difficult to imagine that the user of such a tablet equipped with a camera might want to quickly enter some printed document “on the go.” Meanwhile, all Windows history, starting with the first edition of Tablet PC, confirms the pointlessness of adapting a standard desktop interface to touch controls. Apparently, for these purposes it is much more correct to create a special shell that corresponds to all Metro canons, but uses the same “engine”. Example such a decision serves Internet Explorer from Windows 8.x. In addition, Abbyy even has a certain backlog in the form of Abbyy FineReader Touch for Windows 8, which uses cloud service companies.

If we take our minds off touch input, then there will be more changes in this class - from the quite expected update of windows for opening/saving documents, which, among other things, provide easy access to cloud storage(if there is a corresponding agent and its folder in the system), to several more important and useful ones.

Page processing in Abbyy FineReader 12 Professional is now done in the background. This implies the absence of the former modal window with the status of operations (now this role is played by the status line at the bottom of the screen) and, accordingly, the availability of access to the interface. Thus, the user has the opportunity to work with the program in parallel with the recognition process (if it is, of course, long enough), for example, copy fragments of the received text or even adjust the page layout - the latter will be queued and processed again.

Unlike previous version, also there is no turning of pages during recognition or when bootstrap document, if automatic recognition disabled. In Abbyy FineReader 12 Professional, the document is loaded and divided into pages almost instantly, and their thumbnails are built only as you manually scroll through the left panel. Among other things, this saves computing resources, quite noticeably on large multi-page documents.

The remaining changes in this class are not so interesting, although they may be useful in some scenarios, so we will talk about them briefly.

If you do not need to process the entire document, but only quote individual passages, then you can disable all automatic operations and select the necessary fragments of any type, immediately copying them to the clipboard - while analysis and recognition will be performed on the fly.

To get a result with a simpler structure than the original, you can disable the recreation of headers, footers, and other layout elements. This can be useful, for example, when preparing e-books.

Continuing about e-books - Abbyy FineReader 12 Professional supports EPUB formats 2.0.1 and 3.0.

The conversion options to XLSX have been expanded, for example, it is now possible to clear formatting or save images.

When saving resulting documents to PDF with a text layer, you can now use new technology Abbyy Precise Scan, which consists of smoothing characters on original page images. By the way, it is available only in color mode.

The effect of her work is quite noticeable, although not always, let’s say, “academic.” However, the readability of antialiased characters should be higher in any case, and in in this example The original is really very low quality.


OCR

Now let's see what improvements have occurred in the recognition mechanisms themselves.

The developers report the next stage in improving ADRT technology, which, let me remind you, analyzes and recreates the logical structure of the document. It is declared that it has begun to work much more accurately, especially with tables, lists, and diagrams. Demonstrating this with adequate examples is not so easy, but not impossible. Here, for example, are the recognition results (with default settings) of the same page in Abbyy FineReader 11 Professional (above) and Abbyy FineReader 12 Professional (below).


The old version selected and processed only the main text block, perhaps considering the remaining elements as “garbage” due to the low quality of the original. The new one, on the contrary, correctly identified the list and tried to recreate it. The result, however, is not ideal: the fact that not all markers were recognized can, again, be attributed to the quality of the image, but the program, apparently, still did not understand that there was content in front of it, otherwise it would not have interpreted the numbers as letters. However, progress is obvious and such claims might not have been made with higher quality originals.

And here's how an "implicit" table is processed without dividing lines- Abbyy FineReader 11 Professional (top) and Abbyy FineReader 12 Professional (bottom).


It is clearly visible that the old version, unlike the new one, did not see a table structure here at all and was limited to a set of unrelated text blocks. Take the time to click on the images and compare the recognition results - Abbyy FineReader 12 Professional is close to ideal.

Unfortunately, this does not always happen, and already on the neighboring pages Abbyy FineReader 12 Professional showed results similar to Abbyy FineReader 11 Professional. Although it would be ADRT who should have tracked the identical “caps” and understood that in front of it was a kind of flowing table.

But it is still clearly noticeable that the updated algorithms pay attention to more details than before. During testing of Abbyy FineReader 12 Professional, for example, there was even an attempt to interpret a picture with an ordered placement on it as a table text information. Much more often, the new version also tries to recreate various diagrams and diagrams based on background picture, and not from separate graphic and text blocks.

There are several other new features designed to improve the quality of recognition in Abbyy FineReader 12 Professional. As you know, one of the prerequisites for this is the quality of the original, especially if it was obtained using a camera rather than a scanner. That is why, at one time, FineReader included tools pre-treatment originals. In the new version, their list has been expanded, cropping along the edges of pages, lightening and leveling the background brightness, and removing colored elements have been added. The latter can be useful, for example, for processing documents with seals and stamps. In addition, the user can now connect various methods individually.

Language support has also been improved. Firstly, a Russian alphabet with accents has appeared, and secondly, an increase in the quality of recognition of Chinese, Japanese and Korean (up to 20%), Arabic (up to 60%), and Hebrew (up to 10%) is declared - this has apparently been achieved through improvement and additional training of classifiers.

And finally, one of the most burning questions for many readers: has the speed of the program increased? It is not so easy to answer this question reasonably, especially with numbers - there are too many languages, each of which has its own nuances; the variety of originals is too great; There are too many unknown factors influencing the operation of algorithms. Therefore, even the developers themselves are quite restrained when talking about an increase in the performance of Abbyy FineReader 12 Professional by 10-15%.

Such figures are usually obtained from the results of processing fairly large amounts of documents and, accordingly, represent something like the “average temperature in the hospital.” Therefore, it is useful to study in more detail some illustrative special cases, for example, like the following two:

  • scanned in color with a resolution of 300 dpi 10 pages of a full-color booklet in A4 format. The quality is good, languages ​​are Russian and English, the layout is complex;
  • PDF with graphic images 138 pages of the book containing a small number of color and black and white illustrations, several tables. The quality is low (starting, apparently, with the “blind” printing in the paper book), the languages ​​are Ukrainian and Russian, the layout is simple.

Both documents were recognized in color mode, and the second one was also recognized in black and white, which was intended to simulate the preparation process e-book. All default settings were left unchanged, with the exception of the set of languages ​​and, accordingly, operating modes. A PC with an i5-3450 processor and 8 GB of memory was used as a testing ground. The results are presented in the following table:

As you can see, for PDF the speedup even exceeds the promised 15% - perhaps this is just one of special occasions, well suited for latest optimizations in recognition algorithms. It should be borne in mind that programs, generally speaking, have done different amounts of work. Just look at the illustrations above for table processing - it’s hard to say which version was more difficult.

As for the number of errors, it was practically the same for both versions, although it was noticeable that sometimes different fragments and symbols raise doubts - this, apparently, is evidence of the training of the algorithms. In any case, the majority of uncertainly recognized characters were absolutely correctly identified using dictionaries, and “gross” errors (incorrect interpretation of special and decorative symbols, text on graphics, etc.) coincided. So the difference can be considered completely disappearing.

Another question is, how much does such productivity improvement matter? Apparently, the gain of half a minute on 138 pages that still need to be checked and possibly corrected is not worth much. If work like test tasks is supposed to be performed occasionally, then you definitely don’t have to worry about performance. It's a different matter when it comes to offline processing. large volumes documents, which is available in Abbyy FineReader 12 Corporate. In this case, saving 15% of time is already quite noticeable.

Summary

Despite the fact that the new Abbyy FineReader 12 Professional did not promise anything revolutionary, at least the few changes it makes are commendable. First of all, these are improvements to ADRT technology in terms of recognizing tables, diagrams and the logical structure of pages in general, which in some cases allows you to get radically top scores, and background mode processing, which opens up new opportunities for interactive work with large documents.

There are also many other changes, although they are less significant. Movement towards support touch control today it is certainly justified, but the path chosen is a vicious one - to provide the same in one interface comfortable work It's hardly possible with a mouse and fingers. However, for now, Windows tablets are just trying to break into the market, and the developers from Abbyy still have time.

Prices for Abbyy FineReader 12 Professional:

  • boxed version: 4990 RUR;
  • download version: RUB 4,490;
  • update: 2690 rub.

As usual, the answer to the question “is it worth changing old version to a new one? depends on the situation. In any case, it is worth considering that life cycle FineReader is quite long-lasting, and if any of the described improvements plays any significant role for you, then in 2-3 years the cost of updating will certainly pay off - if not financially, then morally. Solving this question for yourself will finally help.







2024 gtavrl.ru.