Open Source OCR Tesseract installation on Ubuntu and use of it

Leave a comment

First of all you must have command line expertise to use this open source OCR software

At the beginning we are going to install Tesseract on Ubuntu

Open your terminal and write the following command

root@nur-HP:~#apt-get install tesseract-ocr

It will install OCR on your Ubuntu Operating System. Then install your desire language packages. Remember you do not to install English language package because it already installed with tesseract installation.

Here, I going to install Bangla language package

apt-get install tesseract-ocr-[lang]

root@nur-HP:~#apt-get install tesseract-ocr-ben    (This command will install Bangla language package)

If you like to install All language packages, try the following command

root@nur-HP:~#apt-get install tesseract-ocr-all

Our installation has completed. Now we are going to use it

tesseract [image_path] [file_name]

sample command:

root@nur-HP:~#tesseract /home/nurahammad/Dropbox/ForOCR/IMG_20171201_161244.jpg /home/nurahammad/Desktop/test

If you like to see the result on terminal, try below command

tesseract [image_path] stdout

root@nur-HP:~# tesseract /home/nurahammad/Dropbox/ForOCR/IMG_20171201_161244.jpg stdout

I think it will help you for processing your Repository/Digital Library files



Tomcat Manually install in Debian and Ubuntu (Without apt-get method)

Leave a comment

First download apache tomcat


Decompress the downloaded apache tomcat:

tar zxvf apache-tomcat-7.0.76.tar.gz

Now move the decompressed file in the following location:

mv apache-tomcat-7.0.76 /opt/tomcat

Install java

apt-get install default-jdk

Set the environment variables in .bashrc:

nano ~/.bashrc

Add the following lines to the end of the file:

export JAVA_HOME=/usr/lib/jvm/default-java
export CATALINA_HOME=/opt/tomcat

Simply save and exit .bashrc, then make the changes effective by running the following command:

. ~/.bashrc

To start Tomcat, run the following script:


You should get a result similar to:

Using CATALINA_BASE: /opt/tomcat
Using CATALINA_HOME: /opt/tomcat
Using CATALINA_TMPDIR: /opt/tomcat/temp
Using JRE_HOME: /usr/lib/jvm/default-java
Using CLASSPATH: /opt/tomcat/bin/bootstrap.jar:/opt/tomcat/bin/tomcat-juli.jar
Tomcat started.

Test through web browser by the address:


Cron Jobs setup for DSpace production system

1 Comment

Add crob jobs with the following lines in crob jobs
root@dspace#crontab -e

@hourly /home/dspace/bin/dspace itemcounter

@hourly /home/dspace/bin/dspace stats-util -f
@hourly /home/dspace/bin/dspace stats-util -i
@daily /home/dspace/bin/dspace stats-util -u
@daily /home/dspace/bin/dspace stats-util -m

@daily /home/dspace/bin/dspace update-discovery-index -o
@weekly /home/dspace/bin/dspace update-discovery-index -b -f
@monthly /home/dspace/bin/dspace update-discovery-index -c

@weekly /home/dspace/bin/dspace checker -u -d2h -p
@weekly /home/dspace/bin/dspace checker-emailer -a
@monthly /home/dspace/bin/dspace cleanup

10 0 * * * /home/dspace/bin/dspace embargo-lifter

0 1 * * * /home/dspace/bin/dspace sub-daily

10 1 * * * /home/dspace/bin/dspace stat-general
20 1 * * * /home/dspace/bin/dspace stat-monthly

10 2 * * * /home/dspace/bin/dspace stat-report-general
20 2 * * * /home/dspace/bin/dspace stat-report-monthly

@weekly /home/dspace/bin/dspace filter-media
0 3 * * * /home/dspace/bin/dspace index-update
0 6 * * * /home/dspace/bin/dspace generate-sitemaps

Providing First World Library services By using Koha, DSpace, vufind and Drupal

Leave a comment

Providing First World Library services By using Koha, DSpace, vufind and Drupal

Library Automation and Digitization at Chittagong Veterinary and Animal Sciences University 6 December 2012 Providing First World Library services By using Koha, DSpace, vufind and Drupal Presented By Nur Ahammad Consultant of the project- Modernization of Central Library of CVASU

Install DSpace on Debian Squizee

Leave a comment

There are some prerequisite software for DSpace installation on Debian Squizee


Leave a comment

Project Gutenberg offers free books in many formats. Do a search using the word “children” and there’s a list of categories with many different types of children’s books. The link for this search is

Google also offers free ebooks in many formats. Go to Google Books Type in a term, title, or author. One example of a phrase I tried is “children’s stories”. You can search for the type of books you’re interested in using subject terms. Click on the Search button. After you get results, click on Free Google ebooks, which is on the left side of the web page. Your results are for children’s books that are free and can be downloaded from Google.

For U.S. books, please note that many free books are older books which are no longer subject to U.S. copyright law.

 Mrs. P and her free interactive magic library at Mrs. P is played by TV Star Kathy Kinney who played Mimi on The Drew Carey Show.

From this site is not possible to download, but it’s got nice simple search for kids and you can read online:

Syn Item and biblio table

Leave a comment

As you’ve seen, items info are stored in two locations: ‘items’ table
and with the biblio records. If you load data the way you’ve done it,
you have to ‘copy’ data from items table to biblio records. There is
script that can do it automatically for you:


The other option is to create directly items records during biblios
import. You have to put your item info into 952 tag for MARC21 records
(995 for UNIMARC).

Older Entries

%d bloggers like this: