We are working on creating a comprehensive overview over text and media databases for the Islamicate Studies, the latest tools that make working in the Digital Humanities so much easier and tutorials on how to use them.
Your suggestions are very welcome; just drop us an e-mail!
Books on Natural Language Processing
- Habash, Nizar Y. Introduction to Arabic Natural Language Processing. San Rafael, Calif: Morgan and Claypool Publishers, 2010.
- Lachkar, Abdelmonaime., Bouzoubaa, Karim., Mazroui, Azzedine., Hamdani, Abdelfettah., and Lekhouaja, Abdelhak. Arabic Language Processing: From Theory to Practice 6th International Conference, ICALP 2017, Fez, Morocco, October 11–12, 2017, Proceedings . Cham: Springer International Publishing, 2018.
- Shaalan, Khaled., Hassanien, Aboul Ella., and Tolba, Fahmy. Intelligent Natural Language Processing: Trends and Applications. Cham: Springer International Publishing, 2018.
- Zitouni, Imed, Editor. Natural Language Processing of Semitic Languages. Theory and Applications of Natural Language Processing Edited Volumes. Springer, 2014.
Online resources and Tools
CAMeL tools: Suite of Arabic natural language processing tools – http://github.com/CAMeL-Lab/camel_tools
- Farasa: State-of-the-art full-stack package to deal with Arabic Language Processing – http://farasa.qcri.org
- MADAMIRA: Morphological analyzer and disambiguator for Arabic – http://camel.abudhabi.nyu.edu/madamira
- Text-fabric: A corpus of ancient texts and linguistic annotations – http://annotation.github.io/text-fabric/tf/index.html
- Open Islamicate Texts Initiative (OpenITI): A multi-institutional effort to construct the first machine-actionable scholarly corpus of premodern Islamicate texts – https://maximromanov.github.io/OpenITI/ – http://github.com/OpenITI/RELEASE
- Tashkeela: Arabic diacritization corpus – http://sourceforge.net/projects/tashkeela
- Quranic Arabic Corpus: A linguistically annotated resource corpus for the Holy Quran – http://corpus.quran.com
- Arabic Papyrology Database: Editions of pre-modern Arabic documents – http://www.naher-osten.lmu.de/apd
- Arabic Papyrology School: Introduction to methods that enable you to read and understand original Arabic documents – http://www.naher-osten.lmu.de/aps
- Archive of Arabic Talk Shows – http://www.naher-osten.lmu.de/aats
- Prosopography of the ulemas of Al-Andalus – https://www.eea.csic.es/pua
- Bio-bibliographic resource about 5000 Andalusi ulemas and their works – https://www.eea.csic.es/red/hata
- KITAB: Project on text-reuse in Arabic texts – http://kitab-project.org
- The Syriac Electronic Data Research Archive (SEDRA): A linguistic and literary database of the Syriac language and literature – http://sedra.bethmardutho.org
- Simtho: The Syriac thesaurus is a medium-size database of Syriac literary texts – http://bethmardutho.org/simtho
- AraVe: Word embedding models for Arabic NLP – http://github.com/bakrianoo/aravecJedli: Toolbox that contains several distinct search functions for Arabic texts – http://www.islamic-empire.uni-hamburg.de/publications-tools/digital-tools/jedli.html
- Mamluk database: Comprehensive bibliographies of all primary sources relating to the Mamluk sultanate of Egypt and Syria – http://mamluk.lib.uchicago.edu
- Fihrist: Union Catalogue of Manuscripts from the Islamicate World – http://www.fihrist.org.uk
- Shamila: Arabic Digital Library – http://shamela.ws
- al-Raqmiyyāh: Personal webpage and project pages of Maxim Romanov – http://alraqmiyyat.github.io
- Arabic and Middle Eastern Electronic Library (AMEEL): Web-based portal and a digital collection of information for the study of the Middle East – http://findit.library.yale.edu/catalog/digcoll:2835720
- Bibliotheca Arabica: Project dedicated to research on Arabic literatures dating from the years 1150 to 1850 CE – http://www.saw-leipzig.de/de/projekte/bibliotheca-arabica/intro
- Contemporary Bioethics and the History of the Unborn in Islam (COBHUNI): Project on prenatal life in Islamic history that contains a corpus on Islamic bioethics and digital tools for Arabic – http://www.cobhuni.uni-hamburg.de
- The intertwined world of the oral and written transmission of sacred traditions in the Middle East (InterSaME): Project on the transmission of sacred texts among Islam, Eastern Christianity and Judaism that offers novel appoches to transcribe and annotate early Quranic manuscirpts – http://www.intersame.uni-hamburg.de
- Database of Ottoman Inscriptions (DOI): Searchable digital database comprising information about, as well as transliterations and pictures of, all the Turkish, Arabic and Persian architectural inscriptions created in the Ottoman lands during Ottoman times – http://info.ottomaninscriptions.com
- Digital Hammurabi: Podcast, Blog and Literature on Assyriology and the Hebrew Bible – http://www.digitalhammurabi.com
- Digital Humanities Institute Beirut (DHIB): Institute that distributes information on new computing technologies and their impact on the humanities – http://dhibeirut.wordpress.com
- Ganjoor: Machine-readable and open-access corpus of Persian literature – https://ganjoor.net – https://github.com/ganjoor
- Hazine: A guide to researching the middle east and beyond – http://hazine.info
- Islamicate Digital Humanities Network (IDHN): Network of scholars with a research focus on Islamicate Digital Studies – http://idhn.org
- Mamluk Prosopography Project (MPP): Project on historiography, Political Order and State Formation in 15th Century Egypt and Syria – http://www.mms.ugent.be
- “Translatio”: digitizes Arabic, Ottoman-Turkish and Persian magazines (published between 1860 and 1945) and provides a range of information on other collections and projects. You can find it on www.translatio.uni-bonn.de
- Maydan: Online publication of Ali Vural Ak Center for Global Islamic Studies at George Mason University offering expert analysis on a wide variety of issues in the field of Islamic Studies for academic and public audiences alike – http://themaydan.com
- MENADoc: Open Access digital documents with regard to the MENA-region and to Islamic studies – http://menadoc.bibliothek.uni-halle.de
- Mizan Project: digital initiative dedicated to encouraging informed public discourse and interdisciplinary scholarship on the culture and history of Muslim societies – http://mizanproject.org
- Onomasticon Arabicum (OA): Database of more than 27000 scholars and celebrities from the first Muslim millenary – http://onomasticon.irht.cnrs.fr
- Persian Digital Humanities: Initiative in Persian Digital Humanities at the University of Maryland – http://persdig.umd.edu
- Persian Digital Library: Open-access corpus of more than 60,000 Persian poems (and a few prose texts) from the raw texts provided by Ganjoor – http://persdigumd.github.io/PDL
- Persian Manuscript Initiative: Web dedicated to the preservation and digital enhancement of Persian manuscripts throughout the world. – http://persianmanuscriptinitiative.github.io
- Ptolemaeus Arabus et Latinus (PAL): Online Databse of Ptolemeus texts in Latin and Arabic – http://ptolemaeus.badw.de/start
- Researching the Colonial Histories of the Malay Archipelagoes: Resource Guide on Malayasia, Indonesia, the Philippines, Thailand and Singapore – http://sites.google.com/um.edu.my/colonialhistoryofthemalayworld/home
- SHARIAsource Lab: Primary sources and publications on Islamic law – http://pil.law.harvard.edu/shariasource-portal
- The Digital Orientalist: Online magazine on digital tools for Oriental Studies – http://digitalorientalist.com
- Vezveze-Kandu: Blog about Iranian studies containing digital topics – http://vezveze-kandu.de
- ʿArūż (Persian Prosody): Online tool that performs quantitative analysis of Persian meter – http://prosody.ir
- Vejinbooks: Kurdish Poems Corpus – http://books.vejin.net