Discussion:
[Wikimediaindia-l] Wikisource OCR tool
Tito Dutta
2018-11-26 15:26:45 UTC
Permalink
Hello,
We just completed Indic Wikisource consultation 2018, and one of the most
important part of the consultation was the Wikisource tech needs-assessment
We have got a few tools and scripts suggestion which Indic TechCom may work
on.

Meanwhile, please have a look at:
https://meta.wikimedia.org/wiki/Indic-TechCom/Tools/IndicOCR or the tool
link directly https://tools.wmflabs.org/jayprakashbot/
This is a web OCR tool. Currently Google OCR does not support the following
languages and it will help there

1. Malayalam Wikisource
2. Telugu Wikisource
3. Odiaa Wikisource
4. Gujarati Wikisource
5. Kannada Wikisource
6. Punjabi Wikisource


I'll keep the list informed about further development.


Thanks
Tito Dutta
Note: If I don't reply to your email in 2 days, please feel free to remind
me over email or phone call.
Jay prakash
2018-11-26 16:01:07 UTC
Permalink
Just a correction, https://tools.wmflabs.org/jayprakashbot/ is my personal
experimental tool. The Correct URL is https://tools.wmflabs.org/indic-ocr/

Thank you :)
Post by Tito Dutta
Hello,
We just completed Indic Wikisource consultation 2018, and one of the most
important part of the consultation was the Wikisource tech needs-assessment
We have got a few tools and scripts suggestion which Indic TechCom may
work on.
https://meta.wikimedia.org/wiki/Indic-TechCom/Tools/IndicOCR or the tool
link directly https://tools.wmflabs.org/jayprakashbot/
This is a web OCR tool. Currently Google OCR does not support the
following languages and it will help there
1. Malayalam Wikisource
2. Telugu Wikisource
3. Odiaa Wikisource
4. Gujarati Wikisource
5. Kannada Wikisource
6. Punjabi Wikisource
I'll keep the list informed about further development.
Thanks
Tito Dutta
Note: If I don't reply to your email in 2 days, please feel free to remind
me over email or phone call.
_______________________________________________
Wikimediaindia-l mailing list
To unsubscribe from the list / change mailing preferences visit
https://lists.wikimedia.org/mailman/listinfo/wikimediaindia-l
Shrinivasan T
2018-11-27 14:53:11 UTC
Permalink
Great work.

Thanks JP for the nice work.

Please share the link to the source in this page.
https://tools.wmflabs.org/indic-ocr/

Shrini
Satdeep Gill
2018-11-27 14:54:59 UTC
Permalink
Tito, Thank you for sharing the link here. This tool is really helpful.
Barnstars to Jay Prakash.
Post by Tito Dutta
Hello,
We just completed Indic Wikisource consultation 2018, and one of the most
important part of the consultation was the Wikisource tech needs-assessment
We have got a few tools and scripts suggestion which Indic TechCom may
work on.
https://meta.wikimedia.org/wiki/Indic-TechCom/Tools/IndicOCR or the tool
link directly https://tools.wmflabs.org/jayprakashbot/
This is a web OCR tool. Currently Google OCR does not support the
following languages and it will help there
1. Malayalam Wikisource
2. Telugu Wikisource
3. Odiaa Wikisource
4. Gujarati Wikisource
5. Kannada Wikisource
6. Punjabi Wikisource
I'll keep the list informed about further development.
Thanks
Tito Dutta
Note: If I don't reply to your email in 2 days, please feel free to remind
me over email or phone call.
_______________________________________________
Wikimediaindia-l mailing list
To unsubscribe from the list / change mailing preferences visit
https://lists.wikimedia.org/mailman/listinfo/wikimediaindia-l
Jay prakash
2018-11-27 15:34:55 UTC
Permalink
@Shrinivasan T, You can find the source at GitHub.

https://github.com/wikimedia/labs-tools-indic-ocr
Post by Satdeep Gill
Tito, Thank you for sharing the link here. This tool is really helpful.
Barnstars to Jay Prakash.
Post by Tito Dutta
Hello,
We just completed Indic Wikisource consultation 2018, and one of the most
important part of the consultation was the Wikisource tech needs-assessment
We have got a few tools and scripts suggestion which Indic TechCom may
work on.
https://meta.wikimedia.org/wiki/Indic-TechCom/Tools/IndicOCR or the tool
link directly https://tools.wmflabs.org/jayprakashbot/
This is a web OCR tool. Currently Google OCR does not support the
following languages and it will help there
1. Malayalam Wikisource
2. Telugu Wikisource
3. Odiaa Wikisource
4. Gujarati Wikisource
5. Kannada Wikisource
6. Punjabi Wikisource
I'll keep the list informed about further development.
Thanks
Tito Dutta
Note: If I don't reply to your email in 2 days, please feel free to
remind me over email or phone call.
_______________________________________________
Wikimediaindia-l mailing list
To unsubscribe from the list / change mailing preferences visit
https://lists.wikimedia.org/mailman/listinfo/wikimediaindia-l
_______________________________________________
Wikimediaindia-l mailing list
To unsubscribe from the list / change mailing preferences visit
https://lists.wikimedia.org/mailman/listinfo/wikimediaindia-l
Suyash Dwivedi
2018-11-28 04:41:21 UTC
Permalink
Good going Jay

Regards,
Suyash Dwivedi
(U:Suyash.dwivedi)
sent from mobile, please consider any typos
Post by Jay prakash
@Shrinivasan T, You can find the source at GitHub.
https://github.com/wikimedia/labs-tools-indic-ocr
Post by Satdeep Gill
Tito, Thank you for sharing the link here. This tool is really helpful.
Barnstars to Jay Prakash.
Post by Tito Dutta
Hello,
We just completed Indic Wikisource consultation 2018, and one of the
most important part of the consultation was the Wikisource tech
needs-assessment
We have got a few tools and scripts suggestion which Indic TechCom may
work on.
https://meta.wikimedia.org/wiki/Indic-TechCom/Tools/IndicOCR or the
tool link directly https://tools.wmflabs.org/jayprakashbot/
This is a web OCR tool. Currently Google OCR does not support the
following languages and it will help there
1. Malayalam Wikisource
2. Telugu Wikisource
3. Odiaa Wikisource
4. Gujarati Wikisource
5. Kannada Wikisource
6. Punjabi Wikisource
I'll keep the list informed about further development.
Thanks
Tito Dutta
Note: If I don't reply to your email in 2 days, please feel free to
remind me over email or phone call.
_______________________________________________
Wikimediaindia-l mailing list
To unsubscribe from the list / change mailing preferences visit
https://lists.wikimedia.org/mailman/listinfo/wikimediaindia-l
_______________________________________________
Wikimediaindia-l mailing list
To unsubscribe from the list / change mailing preferences visit
https://lists.wikimedia.org/mailman/listinfo/wikimediaindia-l
_______________________________________________
Wikimediaindia-l mailing list
To unsubscribe from the list / change mailing preferences visit
https://lists.wikimedia.org/mailman/listinfo/wikimediaindia-l
Sudhanwa Jogalekar
2018-11-28 16:14:00 UTC
Permalink
Many thanks Jay for your work. Will check out the code.

What is the Google terms and conditions for using their OCR services? A
link to that will also help.

Regards
Sudhanwa
Post by Jay prakash
@Shrinivasan T, You can find the source at GitHub.
https://github.com/wikimedia/labs-tools-indic-ocr
Post by Satdeep Gill
Tito, Thank you for sharing the link here. This tool is really helpful.
Barnstars to Jay Prakash.
Post by Tito Dutta
Hello,
We just completed Indic Wikisource consultation 2018, and one of the
most important part of the consultation was the Wikisource tech
needs-assessment
We have got a few tools and scripts suggestion which Indic TechCom may
work on.
https://meta.wikimedia.org/wiki/Indic-TechCom/Tools/IndicOCR or the
tool link directly https://tools.wmflabs.org/jayprakashbot/
This is a web OCR tool. Currently Google OCR does not support the
following languages and it will help there
1. Malayalam Wikisource
2. Telugu Wikisource
3. Odiaa Wikisource
4. Gujarati Wikisource
5. Kannada Wikisource
6. Punjabi Wikisource
I'll keep the list informed about further development.
Thanks
Tito Dutta
Note: If I don't reply to your email in 2 days, please feel free to
remind me over email or phone call.
_______________________________________________
Wikimediaindia-l mailing list
To unsubscribe from the list / change mailing preferences visit
https://lists.wikimedia.org/mailman/listinfo/wikimediaindia-l
_______________________________________________
Wikimediaindia-l mailing list
To unsubscribe from the list / change mailing preferences visit
https://lists.wikimedia.org/mailman/listinfo/wikimediaindia-l
_______________________________________________
Wikimediaindia-l mailing list
To unsubscribe from the list / change mailing preferences visit
https://lists.wikimedia.org/mailman/listinfo/wikimediaindia-l
Satpal Dandiwal
2018-11-28 16:20:08 UTC
Permalink
thanks Jay! You did a really great job.

Regards
~ Satpal Dandiwal
Post by Sudhanwa Jogalekar
Many thanks Jay for your work. Will check out the code.
What is the Google terms and conditions for using their OCR services? A
link to that will also help.
Regards
Sudhanwa
Post by Jay prakash
@Shrinivasan T, You can find the source at GitHub.
https://github.com/wikimedia/labs-tools-indic-ocr
Post by Satdeep Gill
Tito, Thank you for sharing the link here. This tool is really helpful.
Barnstars to Jay Prakash.
Post by Tito Dutta
Hello,
We just completed Indic Wikisource consultation 2018, and one of the
most important part of the consultation was the Wikisource tech
needs-assessment
We have got a few tools and scripts suggestion which Indic TechCom may
work on.
https://meta.wikimedia.org/wiki/Indic-TechCom/Tools/IndicOCR or the
tool link directly https://tools.wmflabs.org/jayprakashbot/
This is a web OCR tool. Currently Google OCR does not support the
following languages and it will help there
1. Malayalam Wikisource
2. Telugu Wikisource
3. Odiaa Wikisource
4. Gujarati Wikisource
5. Kannada Wikisource
6. Punjabi Wikisource
I'll keep the list informed about further development.
Thanks
Tito Dutta
Note: If I don't reply to your email in 2 days, please feel free to
remind me over email or phone call.
_______________________________________________
Wikimediaindia-l mailing list
To unsubscribe from the list / change mailing preferences visit
https://lists.wikimedia.org/mailman/listinfo/wikimediaindia-l
_______________________________________________
Wikimediaindia-l mailing list
To unsubscribe from the list / change mailing preferences visit
https://lists.wikimedia.org/mailman/listinfo/wikimediaindia-l
_______________________________________________
Wikimediaindia-l mailing list
To unsubscribe from the list / change mailing preferences visit
https://lists.wikimedia.org/mailman/listinfo/wikimediaindia-l
_______________________________________________
Wikimediaindia-l mailing list
To unsubscribe from the list / change mailing preferences visit
https://lists.wikimedia.org/mailman/listinfo/wikimediaindia-l
Dr. Manavpreet Kaur
2018-11-29 15:36:37 UTC
Permalink
Hi Jay,

You have done an excellent job. Kudos to you and your team (the ones
involved in planning, guidance and of course motivation).

Regards,
Manav
Post by Tito Dutta
Hello,
We just completed Indic Wikisource consultation 2018, and one of the most
important part of the consultation was the Wikisource tech needs-assessment
We have got a few tools and scripts suggestion which Indic TechCom may
work on.
https://meta.wikimedia.org/wiki/Indic-TechCom/Tools/IndicOCR or the tool
link directly https://tools.wmflabs.org/jayprakashbot/
This is a web OCR tool. Currently Google OCR does not support the
following languages and it will help there
1. Malayalam Wikisource
2. Telugu Wikisource
3. Odiaa Wikisource
4. Gujarati Wikisource
5. Kannada Wikisource
6. Punjabi Wikisource
I'll keep the list informed about further development.
Thanks
Tito Dutta
Note: If I don't reply to your email in 2 days, please feel free to remind
me over email or phone call.
_______________________________________________
Wikimediaindia-l mailing list
To unsubscribe from the list / change mailing preferences visit
https://lists.wikimedia.org/mailman/listinfo/wikimediaindia-l
Loading...