Page 1 of 2

Scanning and recognition documents

Posted: 14 Oct 2012, 22:28
by PaulBy
Helo :)
This's kit for scanning and recognition of texts CuneiForm+YAGF+xSane+Poppler.
The module scans the text via xSane or receives it from the images, including imports in itself pdf-files (poppler), and identifies it with the use of CuneiForm. As a General graphical shell is used YAGF. All of the libraries and dependencies built into the module.
Link for downloads - http://yadi.sk/d/o21CTJFq1qLeR
Update 12.01.2013

Enjoy 8)

Re: Scanning and recognition documents

Posted: 15 Oct 2012, 06:18
by Hamza
You shouldn't merge all modules into one because you may have some troubles related to deps and also version of some deps. You can make a zip to make easier for user to update included package.

Re: Scanning and recognition documents

Posted: 15 Oct 2012, 14:22
by PaulBy
Hamza wrote:You shouldn't merge all modules into one because you may hav"e some troubles related to deps and also version of some deps. You can make a zip to make easier for user to update included package.
Package created for the specific application task work with the scanner and the text.
From my experience in Porteus, problems with dependencies to start exactly with the use of multiple packages to the same task. If you need to upgrade some components, the easier for me again rebuild the package, after checking his performance after the update, and only then offer it for use by other users.
The problem of performance of the program should be addressed by the author of the package, and not the users. :oops:

Re: Scanning and recognition documents

Posted: 16 Oct 2012, 20:55
by Hamza
So, how the users that uses your final module will be able to update a component ?

It was a simple suggestion :)

Re: Scanning and recognition documents

Posted: 16 Oct 2012, 22:24
by PaulBy
Updates or changes to the module makes the creator of the module, test it, and puts on the performance of the repository. If the user wants to use the new module, it will download and replace it in the folder / modules.
In my opinion, this is not a bad metod :oops:

Re: Scanning and recognition documents

Posted: 17 Oct 2012, 05:35
by Hamza
Up to you :)

Re: Scanning and recognition documents

Posted: 01 Jan 2013, 00:34
by jcsoh
Thanks for the module . I am not using it directly as I am using Slax 7 (actually Porteus xzm usually works with Slax 7 just by renaming the extension to sb).
I never heard of Cuneiform or Yagf before. Since I already have Xsane+Sane and Poppler , I just need Yagf ,Cuneiform and I seem to need Imagemagick as well.
I have now a working Yagf+Tesseract+Cuneiform bundle (yeah this is what Tomas name the new Slax 7 module with sb extension) .

Your module make it easy to know what is required , and solved the lack of gui for scanner in linux. Much aprreciated. :good:

Re: Scanning and recognition documents

Posted: 03 Jan 2013, 03:40
by francois
@jcsoh:
Really nice to see you on this forum. :D

xsane is a front-end gui for sane:
http://forum.porteus.org/viewtopic.php? ... imple+scan
There is also simple-scan, which is even simpler than xsane:
http://forum.porteus.org/viewtopic.php? ... imple+scan
However, scan recognition is something I did not knew.

Thanks PaulBy. :)

Re: Scanning and recognition documents

Posted: 03 Jan 2013, 13:23
by jcsoh
I saw your thread on simple scan and I think I tried to make a slax 6 or slax 7 , but failed some where along the line ?.
Perhaps I will try again.
Xsane is not really that complex if you ignored all the options and sticks to a few basic and just accept the default for most options.
Most of the time I actually use the scanner as a photocopier ie to scan a paper copy so as to print /make copies.

Re: Scanning and recognition documents

Posted: 03 Jan 2013, 17:56
by francois
Simple scan is a front end for sane. Thus you need sane too. Here is simple scan for porteus 32 bit:
http://www.mediafire.com/?fyqfrgdyc0rfeet

Re: Scanning and recognition documents

Posted: 10 Jan 2013, 01:27
by brokenman
Thanks. Very handy. I have added these packages to the 14.0 repo (not online as yet) for Porteus v.20

Re: Scanning and recognition documents

Posted: 10 Jan 2013, 19:11
by francois
Very good faith for this must have application.

Re: Scanning and recognition documents

Posted: 13 Jan 2013, 21:03
by PaulBy
Hello, frends :)
In view of the interest to my package, updated all included in it components to the actual version of the 12.01.2013 - http://yadi.sk/d/o21CTJFq1qLeR

Dear, Brokenman, this package includes deps for Porteus-2.0:
aspell-ru-0.99f7_1-noarch-4.xzm*
cuneiform--1.1.0--0.1.5.xzm*
libgphoto2-2.4.14-i486-2.xzm*
libieee1284-0.2.11-i486-3.xzm*
libnetsnmp30-5.7.2-0.0.pre1.1-mdv2012.0.i586.xzm*
libtiff4-3.9.2-1.xzm*
libv4l1-0-0.8.8-3.1.2.i586.xzm*
libv4l2-0-0.8.8-3.1.2.i586.xzm*
libv4lconvert0_0.8.8-3_i386.xzm*
poppler-0.20.2-i486-1.xzm*
poppler-data-0.4.5-noarch-1.xzm*
qt-4.7.0_7abde40-i486-3.xzm*
sane-1.0.22-i486-5.xzm*
xSane-0.998.xzm*
yagf_0.9.1-3_i386.xzm*

Enjoy 8)

Re: Scanning and recognition documents

Posted: 25 Oct 2013, 08:50
by Rava
PaulBy wrote:Hello, frends :)
In view of the interest to my package, updated all included in it components to the actual version of the 12.01.2013 - http://yadi.sk/d/o21CTJFq1qLeR

Dear, Brokenman, this package includes deps for Porteus-2.0:
aspell-ru-0.99f7_1-noarch-4.xzm*
Is aspell-ru needed fotr all users who never scan and OCR russian fonts?
PaulBy wrote: cuneiform--1.1.0--0.1.5.xzm*
Where can I get that stand alone module?

See here: http://forum.porteus.org/viewtopic.php? ... 850#p18777

I want it to be as minimal as possible, I even break up dependencies that are huge, like qt-4.7.0, to only have the libraries that my programs neds, like I did when creating a dependency module for fbreader:

Code: Select all

fbreader-0.99.2_deps_qt-4.8.4STRIPPED+liblinebreak-2.1+fribidi-0.19.2--x86_64.rava.xzm
I mark that part with STRIPPED, I do the same for programs with complicated menues and lots of localisation, and could shrink down my GIMP module quite some, see here: tip - shrinking down your own modules

My above dependency module for fbreader is approx 4.19 MB, while qt alone is > 24 M...

So, I want to build me a scanning module with the core elements, and a minimal dependency module like mentioned above, also creating a version just for me with both techniques, aka shrinking the module by removing unneeded locales.

And please, do comment in here http://forum.porteus.org/viewtopic.php?f=53&t=2235 as well...

Re: Scanning and recognition documents

Posted: 25 Oct 2013, 08:53
by Rava
Update:
Found the cuneiform--1.1.0--0.1.5.xzm:
http://code.google.com/p/fidoslax/downl ... -0.1.5.xzm
:)

//Update 2
I forgot to add... I need it as x86-64 variant, not the x86 one...