WordNet in Indian Languages -

WordNet in Indian Languages (eBook)

eBook Download: PDF
2016 | 1st ed. 2017
XVII, 264 Seiten
Springer Singapore (Verlag)
978-981-10-1909-8 (ISBN)
Systemvoraussetzungen
96,29 inkl. MwSt
  • Download sofort lieferbar
  • Zahlungsarten anzeigen

This contributed volume discusses in detail the process of construction of a WordNet of 18 Indian languages, called 'Indradhanush' (rainbow) in Hindi. It delves into the major challenges involved in developing a WordNet in a multilingual country like India, where the information spread across the languages needs utmost care in processing, synchronization and representation. The project has emerged from the need of millions of people to have access to relevant content in their native languages, and it provides a common interface for information sharing and reuse across the Indian languages. 

The chapters discuss important methods and strategies of language computation, language data processing, lexical selection and management, and language-specific synset collection and representation, which are of utmost value for the development of a WordNet in any language. The volume overall gives a clear picture of how WordNet is developed in Indian languages and how this can be utilized in similar projects for other languages. It includes illustrations, tables, flowcharts, and diagrams for easy comprehension.  

This volume is of interest to researchers working in the areas of language processing, machine translation, word sense disambiguation, culture studies, language corpus generation, language teaching, dictionary compilation, lexicographic queries, cross-lingual knowledge sharing, e-governance, and many other areas of linguistics and language technology.



Niladri Sekhar Dash, Ph.D., is Associate Professor, Linguistic Research Unit, Indian Statistical Institute, Kolkata. He is also Editor-in-Chief, Journal of Advanced Linguistic Studies, principal investigator of the Indian Languages Corpora Initiative-Bengali and the Digital Bangla Pronunciation Dictionary. His main areas of research are: corpus linguistics, natural language processing, computational, lexicography, machine translation, WordNet design and development, lexical semantics,  computer assisted language teaching, digital language resource development, language documentation and digitization, etc.

Pushpak Bhattacharyya, Ph.D., is  Director, Indian Institute of Technology Patna. Previously, he was Vijay and Sita Vashee Chair Professor, Department of Computer Science and Engineering, Indian Institute of Technology Bombay, Mumbai; consortium leader of the Indradhanush WordNet in Indian languages; Associate Editor, ACM Transaction on Asian Language Information Processing; leader of multi-institute consortia projects on Indian language WordNets, Indian language search engine, and machine translation. Professor Bhattacharyya has been a visiting professor at Stanford University (2004), University of Grenoble (2005, 2009 and 2011) and distinguished lecturer at the University of Houston (2012). His research areas are: natural language processing, machine learning, cross lingual IR, information extraction, WordNet design and development, etc.

Jyoti  D. Pawar is Associate Professor, Department of Computer Science and Technology, Goa University, Goa. She is co-consortium leader of the Indradhanush WordNet in Indian languages. Her research areas are: natural language processing (NLP), data mining, data structures, etc.


This contributed volume discusses in detail the process of construction of a WordNet of 18 Indian languages, called "e;Indradhanush"e; (rainbow) in Hindi. It delves into the major challenges involved in developing a WordNet in a multilingual country like India, where the information spread across the languages needs utmost care in processing, synchronization and representation. The project has emerged from the need of millions of people to have access to relevant content in their native languages, and it provides a common interface for information sharing and reuse across the Indian languages. The chapters discuss important methods and strategies of language computation, language data processing, lexical selection and management, and language-specific synset collection and representation, which are of utmost value for the development of a WordNet in any language. The volume overall gives a clear picture of how WordNet is developed in Indian languages and how this canbe utilized in similar projects for other languages. It includes illustrations, tables, flowcharts, and diagrams for easy comprehension.   This volume is of interest to researchers working in the areas of language processing, machine translation, word sense disambiguation, culture studies, language corpus generation, language teaching, dictionary compilation, lexicographic queries, cross-lingual knowledge sharing, e-governance, and many other areas of linguistics and language technology.

Niladri Sekhar Dash, Ph.D., is Associate Professor, Linguistic Research Unit, Indian Statistical Institute, Kolkata. He is also Editor-in-Chief, Journal of Advanced Linguistic Studies, principal investigator of the Indian Languages Corpora Initiative-Bengali and the Digital Bangla Pronunciation Dictionary. His main areas of research are: corpus linguistics, natural language processing, computational, lexicography, machine translation, WordNet design and development, lexical semantics,  computer assisted language teaching, digital language resource development, language documentation and digitization, etc.Pushpak Bhattacharyya, Ph.D., is  Director, Indian Institute of Technology Patna. Previously, he was Vijay and Sita Vashee Chair Professor, Department of Computer Science and Engineering, Indian Institute of Technology Bombay, Mumbai; consortium leader of the Indradhanush WordNet in Indian languages; Associate Editor, ACM Transaction on Asian Language Information Processing; leader of multi-institute consortia projects on Indian language WordNets, Indian language search engine, and machine translation. Professor Bhattacharyya has been a visiting professor at Stanford University (2004), University of Grenoble (2005, 2009 and 2011) and distinguished lecturer at the University of Houston (2012). His research areas are: natural language processing, machine learning, cross lingual IR, information extraction, WordNet design and development, etc. Jyoti  D. Pawar is Associate Professor, Department of Computer Science and Technology, Goa University, Goa. She is co-consortium leader of the Indradhanush WordNet in Indian languages. Her research areas are: natural language processing (NLP), data mining, data structures, etc.

Chapter 1. IndoWordNet Pushpak Bhattacharyya.- Chapter 2. Insights on Hindi WordNet Coming from the IndoWordNet Laxmi Kashyap, Salil Rajeev Joshi and Pushpak Bhattacharyya.- Chapter 3. Defining Language Specific Synset (LSS) in IndoWorNet: Some Theoretical and Practical Issues Niladri Sekhar Dash.- Chapter 4. Problems in Translating Hindi Synsets into the Bangla WordNet Niladri Sekhar Dash.- Chapter 5. Development of Punjabi WordNet, Bilingual Dictionaries, Lexical Relations Creation and its Challenges R.K. Sharma and Parteek Kumar.- Chapter 6. Insights on the Konkani WordNet Development Process Shilpa N. Desai, Shantaram W. Walawalikar, Ramdas N. Karmali, and Jyoti D. Pawar.- Chapter 7. Malayalam WordNet S. Rajendran and K. P. Soman.- Chapter 8. Creating Marathi WordNet Lata Popale and Pushpak Bhattacharyya.- Chapter 9. Gujarati WordNet: A Profile of the IndoWordNet Brijesh S. Bhatt, C. K. Bhensdadia, Pushpak Bhattacharyya, Dinesh Chauhan and Kirit Patel.- Chapter 10. Issues in the Creation of Synsets in Odia WordNet: A Report Panchanan Mohanty, Ramesh C. Malik and Bhimasena Bhol.- Chapter 11. Building Telugu WordNet using Expansion Approach S. Arulmozi and M.C. Kesava Murty.- Chapter 12. Challenges, Problems and Issues Faced in Language Specific Synset Creation and Linkage in the Kashmiri WordNet Aadil Amin Kak, Farooq Ahmad, Nazima Mehdi, Mansoor Farooq, and Muneera Hakim.- Chapter 13. Language Specific Synsets and Challenges in Synset Linkage in Urdu WordNet Rizwanur Rahman, Mazhar Mehdi Hussain and Niladri Sekhar Dash.- Chapter 14. Sanskrit Wordnet at IITB Malhar Kulkarni.- Chapter 15. Word Sense Disambiguation Using IndoWordNet Sudha Bhingardive and Pushpak Bhattacharyya.- Appendix I: The Team of the IndoWordNet.

Erscheint lt. Verlag 20.10.2016
Zusatzinfo XVII, 264 p. 76 illus., 59 illus. in color.
Verlagsort Singapore
Sprache englisch
Themenwelt Sachbuch/Ratgeber
Schulbuch / Wörterbuch Wörterbuch / Fremdsprachen
Geisteswissenschaften Sprach- / Literaturwissenschaft Sprachwissenschaft
Informatik Theorie / Studium Künstliche Intelligenz / Robotik
Sozialwissenschaften Pädagogik
Schlagworte homonymy • Hypernymy • hyponymy • Meronymy • Polysemy • synonymy • Synset • WordNet
ISBN-10 981-10-1909-6 / 9811019096
ISBN-13 978-981-10-1909-8 / 9789811019098
Haben Sie eine Frage zum Produkt?
Wie bewerten Sie den Artikel?
Bitte geben Sie Ihre Bewertung ein:
Bitte geben Sie Daten ein:
PDFPDF (Wasserzeichen)
Größe: 11,1 MB

DRM: Digitales Wasserzeichen
Dieses eBook enthält ein digitales Wasser­zeichen und ist damit für Sie persona­lisiert. Bei einer missbräuch­lichen Weiter­gabe des eBooks an Dritte ist eine Rück­ver­folgung an die Quelle möglich.

Dateiformat: PDF (Portable Document Format)
Mit einem festen Seiten­layout eignet sich die PDF besonders für Fach­bücher mit Spalten, Tabellen und Abbild­ungen. Eine PDF kann auf fast allen Geräten ange­zeigt werden, ist aber für kleine Displays (Smart­phone, eReader) nur einge­schränkt geeignet.

Systemvoraussetzungen:
PC/Mac: Mit einem PC oder Mac können Sie dieses eBook lesen. Sie benötigen dafür einen PDF-Viewer - z.B. den Adobe Reader oder Adobe Digital Editions.
eReader: Dieses eBook kann mit (fast) allen eBook-Readern gelesen werden. Mit dem amazon-Kindle ist es aber nicht kompatibel.
Smartphone/Tablet: Egal ob Apple oder Android, dieses eBook können Sie lesen. Sie benötigen dafür einen PDF-Viewer - z.B. die kostenlose Adobe Digital Editions-App.

Buying eBooks from abroad
For tax law reasons we can sell eBooks just within Germany and Switzerland. Regrettably we cannot fulfill eBook-orders from other countries.

Mehr entdecken
aus dem Bereich
der Praxis-Guide für Künstliche Intelligenz in Unternehmen - Chancen …

von Thomas R. Köhler; Julia Finkeissen

eBook Download (2024)
Campus Verlag
38,99