被汉王郁闷了,谁有成熟的ocr sdk介绍
公司有个开发需求,需要对上传的文档类图片进行文字识别,然后存放到数据库里;
Tesseract-ocr下载测试过, 不管是处理速度还是针对汉字的解析能力都不能达到需求,
网上都说汉王是成熟产品,我也查了下汉王的这类资料, 发现针对手机名片识别的还是比较成熟的,也不知道能不能达到我的要求,最离谱的是打了2天的咨询电话没人接, 今天有人接了还直接给我挂掉了,真叫个窝火,
各位大拿们有什么好介绍,要钱的无所谓,反正是方案提交,客户埋单,不过开源的最好.
[解决办法]
VERSION 5.00Begin VB.Form Form1 Caption = "VB实现OCR文字识别" ClientHeight = 3195 ClientLeft = 60 ClientTop = 345 ClientWidth = 4680 LinkTopic = "Form1" ScaleHeight = 3195 ScaleWidth = 4680 StartUpPosition = 3 '窗口缺省 Begin VB.CommandButton Command1 Caption = "识别" Height = 495 Left = 1800 TabIndex = 0 Top = 1320 Width = 1215 EndEndAttribute VB_Name = "Form1"Attribute VB_GlobalNameSpace = FalseAttribute VB_Creatable = FalseAttribute VB_PredeclaredId = TrueAttribute VB_Exposed = FalseOption ExplicitPrivate Sub Command1_Click()Dim strLayoutInfo As StringDim miDoc As ObjectDim modiLayout As Object '初始化并加载文档 Set miDoc = CreateObject("MODI.Document") '创建对象 miDoc.Create "z.tif" '加载图片文件 Screen.MousePointer = vbHourglass '设置光标忙 '识别 miDoc.Images(0).OCR miLANG_CHINESE_SIMPLIFIED, True, True '有用的就此一句,识别为中文简体 Set modiLayout = miDoc.Images(0).Layout '读出数据 strLayoutInfo = _ "Language: " & modiLayout.Language & vbCrLf & _ "Number of characters: " & modiLayout.NumChars & vbCrLf & _ "Number of fonts: " & modiLayout.NumFonts & vbCrLf & _ "Number of words: " & modiLayout.NumWords & vbCrLf & _ "Beginning of text: " & Left(modiLayout.Text, 50) & vbCrLf & _ "First word of text: " & modiLayout.Words(0).Text MsgBox strLayoutInfo, vbInformation + vbOKOnly, "Layout Information" Set modiLayout = Nothing Set miDoc = Nothing Screen.MousePointer = vbDefaultEnd Sub