Murthy's Ideas: NLP for Sanskrit and all Indian Languages

 Murthy's Ideas: NLP for Sanskrit and all Indian Languages

  • Natural Language Processing (NLP) of Sanskrit (samskritam) and samskritam based Indian languages. Develop any needed additional NLP techniques to be developed. English based NLP could be used as an example to develop a native NLPtools,  techniques, and methodology. 

  • bhaaratheeya samskrita, sakala bhaasha gaNaka yantra samskaraNa = bhaarateeya bhaasha gaNaka samskaraNa
  • NLP based ingestion of dictionaries, thesarus in samskritam and all Indian languages 

  •  Develop tatsama ---> tadbhava relationships in the NLP based knowledge corpus. Develop vocabulary corpus between cross bhaarateeya bhaashas.
  •  Once the vocabulary corpus is developed in Indian literature, mantras, sootras, shaastras, samhitas and saahitya are processed.
  • It is imperative to develop not only script or lipibased NLP, and more importantly the spoken words and language, further supplimented by literary corpus to include all genres of literatrure and all languages.
  • Languages like Hindi, Urdu and other localized language versions and dialects need special attention as some of the dialects have many words not derived from samskritam but from middle eastern roots like urdu. 

  •  Especially the so called Hindi language has significant Urdu words, in addition to being the youngest language among other very mature languages with over 2000 years of history. Hindi and Urdu pose severe impediments in developing samskritam based langaues for NLP creation and advancement. 
  • The NLP team should be made up of languages scholars, linguistic scholars, scholars in etymology, computer scientists, NLP experts, engineers and scholars in vedas and shastras in samskritam and different bhaarateeya languages. 

  •  The NLP expertise could be developed through a layered approach, one step at a time. 
  •  Research laboratories across India, universities, independent language institutes, team of language scholars from across the nation.
  • Further and part of the goals, big data, AI, ML and NN techniques applicable to Indian language corpus will be developed and advanced. 

  • Feel free to contact me at Dr.Sri.KRS.Murthy@gmail.com and (408)-464-3333 

Comments

Popular posts from this blog

Murthy's Original DeepavaLi Music Composition and other original music compositions

Cryptocurrency Overview

Video Games, Gamification and VR/AR