文檔金喜正規買球>>LEADTOOLS使用教程>>LEADTOOLS入門教程：Leadtools .NET OCR用法

LEADTOOLS入門教程：Leadtools .NET OCR用法

LEADTOOLS OCR功能提供了將光學字符識別（OCR）技術融合到應用程序中的方法。OCR可將位圖圖像轉換為文本。

一旦在系統中安裝LEADTOOLS .NET OCR工具包，用戶便可以在程序中使用LEADTOOLS OCR。需要注意的是，在用戶使用OCR屬性，方法和事件之前，必須對OCR功能解鎖。

用戶可以添加引用到Leadtools.Forms.Ocr.dll和 Leadtools.Forms.DocumentWriter.dll組件從而啟動LEADTOOLS for .NET OCR。這些組件包含了各種接口、類、結構和委托。

由于LEADTOOLS OCR工具包支持多個引擎，一旦創建了IOcrEngine接口實例，與引擎接口的實際代碼便被存儲在一個被動態加載的單獨程序集中。因此，你必須確保即將使用的引擎程序集位于旁邊的Leadtools.Forms.Ocr.dll組件。如果你需要自動檢測依賴關系，你可以將引擎程序集作為引用添加到程序中。

LEADTOOLS提供了實現下列功能的方法：

從各種文字、文字處理、數據庫或者電子表格文檔中識別和導出文本；
在單線程或者多線程環境下執行OCR處理；
選擇需要識別的文檔語言，如英語，丹麥語，荷蘭語，芬蘭語，法語，德語，意大利語，挪威語，葡萄牙語，俄語，西班牙語或瑞典語；
自動或手動將復雜頁面劃分為文本區，圖像區，表格區，線，頁眉和頁腳；
識別前，設置精度閾值以控制識別精度；
自動檢測傳真，點陣和其他degraded文檔；
支持多種文檔保存格式，如Adobe PDF、 PDF/A, MS Word, MS Excel和UNICODE文本等等。
處理文本和圖形。

LEADTOOLS通過OCR手柄與OCR引擎和包含的頁面列表的OCR文檔進行交互。OCR手柄是安裝在系統上的LEADTOOLS OCR和OCR引擎之間的通信會話。OCR手柄是一種內部結構，包含了識別、獲取信息、設置信息和文本驗證的所有必要信息。

識別單頁或多頁的步驟如下：

1、選擇所需引擎類型并創建IOcrEngine接口實例；

2、利用 IOcrEngine.Startup方法啟動OCR引擎；

3、創建單頁或多頁OCR文檔；

4、手動或自動創建頁面區域；

5、設置OCR引擎所需的活動語言；

6、設置拼寫檢查語言；

7、識別；

8、保存識別結果；

9、關閉OCR引擎。

步驟4，5，6和7可以不必依照順序進行，只要在OCR引擎啟動后和頁面識別之間執行這幾個步驟即可。

下面的示例展示了如何執行上述步驟：

Visual Basic

' Assuming you added "Imports Leadtools.Forms.Ocr" and "Imports Leadtools.Forms.DocumentWriter" at the beginning of this class
' *** Step 1: Select the engine type and create an instance of the IOcrEngine interface.
' We will use the LEADTOOLS OCR Plus engine and use it in the same process
Dim ocrEngine As IOcrEngine = OcrEngineManager.CreateEngine(OcrEngineType.Plus, False)

' *** Step 2: Startup the engine.

' Use the default parameters
ocrEngine.Startup(Nothing, Nothing, Nothing, "C:\LEADTOOLS 18\Bin\Common\OcrAdvantageRuntime")

' *** Step 3: Create an OCR document with one or more pages.

Dim ocrDocument As IOcrDocument = ocrEngine.DocumentManager.CreateDocument()

' Add all the pages of a multi-page TIF image to the document
ocrDocument.Pages.AddPages("C:\Users\Public\Documents\LEADTOOLS Images\Ocr.tif", 1, -1, Nothing)

' *** Step 4: Establish zones on the page(s), either manually or automatically

' Automatic zoning
ocrDocument.Pages.AutoZone(Nothing)

' *** Step 5: (Optional) Set the active languages to be used by the OCR engine

' Enable English and German languages
ocrEngine.LanguageManager.EnableLanguages(New String() {"en", "de"})

' *** Step 6: (Optional) Set the spell checking language
' Enable the spell checking system and set English as the spell language
ocrEngine.SpellCheckManager.SpellCheckEngine = OcrSpellCheckEngine.Native
ocrEngine.SpellCheckManager.SpellLanguage = "en"

' *** Step 7: (Optional) Set any special recognition module options

' Change the fill method for the first zone in the first page to be Omr
Dim ocrZone As OcrZone = ocrDocument.Pages(0).Zones(0)
ocrZone.FillMethod = OcrZoneFillMethod.Omr
ocrDocument.Pages(0).Zones(0) = ocrZone

' *** Step 8: Recognize

ocrDocument.Pages.Recognize(Nothing)

' *** Step 9: Save recognition results

' Save the results to a PDF file
ocrDocument.Save("C:\Users\Public\Documents\LEADTOOLS Images\Document.pdf", DocumentFormat.Pdf, Nothing)
ocrDocument.Dispose()

' *** Step 10: Shut down the OCR engine when finished
ocrEngine.Shutdown()
ocrEngine.Dispose()

// Assuming you added "using Leadtools.Codecs;", "using Leadtools.Forms.Ocr;" and "using Leadtools.Forms.DocumentWriters;" at the beginning of this class
// *** Step 1: Select the engine type and create an instance of the IOcrEngine interface.

// We will use the LEADTOOLS OCR Plus engine and use it in the same process
IOcrEngine ocrEngine = OcrEngineManager.CreateEngine(OcrEngineType.Advantage, false);

// *** Step 2: Startup the engine.

// Use the default parameters
ocrEngine.Startup(null, null, null, @"C:\LEADTOOLS 18\Bin\Common\OcrAdvantageRuntime");

// *** Step 3: Create an OCR document with one or more pages.

IOcrDocument ocrDocument = ocrEngine.DocumentManager.CreateDocument();

// Add all the pages of a multi-page TIF image to the document
ocrDocument.Pages.AddPages(@"C:\Users\Public\Documents\LEADTOOLS Images\Ocr.tif", 1, -1, null);

// *** Step 4: Establish zones on the page(s), either manually or automatically

// Automatic zoning
ocrDocument.Pages.AutoZone(null);

// *** Step 5: (Optional) Set the active languages to be used by the OCR engine

// Enable English and German languages
ocrEngine.LanguageManager.EnableLanguages(new string[] { "en", "de" });

// *** Step 6: (Optional) Set the spell checking language

// Enable the spell checking system and set English as the spell language
ocrEngine.SpellCheckManager.SpellCheckEngine = OcrSpellCheckEngine.Native;
ocrEngine.SpellCheckManager.SpellLanguage = "en";

// *** Step 7: (Optional) Set any special recognition module options

// Change the fill method for the first zone in the first page to be default
OcrZone ocrZone = ocrDocument.Pages[0].Zones[0];
ocrZone.FillMethod = OcrZoneFillMethod.Default;
ocrDocument.Pages[0].Zones[0] = ocrZone;

// *** Step 8: Recognize

ocrDocument.Pages.Recognize(null);

// *** Step 9: Save recognition results

// Save the results to a PDF file
ocrDocument.Save(@"C:\Users\Public\Documents\LEADTOOLS Images\Document.pdf", DocumentFormat.Pdf, null);
ocrDocument.Dispose();

// *** Step 10: Shut down the OCR engine when finished
ocrEngine.Shutdown();
ocrEngine.Dispose();

欧美日韩亚-欧美日韩亚州在线-欧美日韩亚洲-欧美日韩亚洲第一区-欧美日韩亚洲二区在线-欧美日韩亚洲高清精品

金喜正规买球

LEADTOOLS入門教程：Leadtools .NET OCR用法

用科技創就卓越

Create excellence with technology