Manjira Sinha

Research Scientist


Manjira Sinha is research scientist in the Text and Graph Analytics (TGA) group of the Conduent Labs India (previously known as Xerox Research Center India). She joined in May, 2015. Manjira completed her Ph.D. in Text Readability from Indian Institute of Technology Kharagpur. Her research interests are Natural Language Processing, Computational Linguistics, Machine Learning and Human Computer Interaction. She has been associated with a number of projects related to Cross Domain Text Categorization, Social Media Analytics, Urban Informatics, Health-care Text Analytics and NLP for Real-time Consumer Chats. At present, she is leading a project on context driven urban issue resolution with application towards city governance and smart cities.

Prior to joining Xerox in 2015, Manjira have completed PhD from The Department of Computer Science and Engineering, IIT Kharagpur. Her research interests lie in the field of language comprehension and natural language processing, educational technology, human computer interaction and assistive technology. In her doctoral research, she has studied the process of perception of text difficulty in Bangla among the native readers and developed computational models for text readability. Apart from doctoral work, she has been associated with the development of various assistive technology systems in the Communication Empowerment Laboratory, IIT Kharagpur.

Apart from technical areas, she also possesses interests in critical thinking and socio-political issues.

Contact:

• Email: Manjira [dot] sinha [at] conduent [dot] com


  • Journals

  • 1. Manjira Sinha, and Anupam Basu, A Study of Readability of Texts in Bangla through Machine Learning Approaches, Journal of Education and Information Technologies (EAIT), Springer, vol. 21, pp. 1071-1094, 2016 .

  • 2. Tirthankar Dasgupta, Manjira Sinha, and Anupam Basu, Computational Models of the Lexical Representation of Bangla Compound Words in the Mental Lexicon, Journal of Psycholinguistic Research (JOPR), Springer, vol. 45, pp. 833-855, 2016.

  • 3. Tirthankar Dasgupta, Manjira Sinha, and Anupam Basu, Computational Modelling of Morphological Effects in Bangla Visual Word Recognition, Journal of Psycholinguistic Research (JOPR), Springer, vol. 44, pp. 587-610, 2015.

  • 4. Anupam Basu, Tirthankar Dasgupta, and Manjira Sinha, Development of a Framework for Information Processing, and Communication for Blind People in India, Journal of Technology and Disability (T&D), IOS Press, vol. 27, no. 3, pp. 65-78, 2015.

  • 5. Tirthankar Dasgupta, Manjira Sinha, and Anupam Basu, Resource Creation and Development of an English-Bangla Back Transliteration System, International Journal of Knowledge- Based and Intelligent Engineering Systems (KES), IOS Press, vol. 19, no. 1, pp. 35-46, 2015.

  • Conferences

  • 1. Anirban Sen, Manjira Sinha, and Sandya Mannarswamy, Multi-task Representation Learning for Enhanced Emotion Categorization in Short Text, Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD), 2017 (to appear)

  • 2. Shreshtha Mundra, Anirban Sen, Manjira Sinha, Sandya Mannarswamy, Sandipan Dandapat and Shourya Roy , Fine-grained Emotion Detection in Contact Center Chat Utterances, Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD), 2017 (to appear)

  • 3. Himanshu S. Bhatt, Manjira Sinha and Shourya Roy , Cross-domain Text Classification with Multiple Domains and Disparate Label Sets , 54th annual meeting of the Association for Computational Linguistics(ACL), 2016, pp. 1641-1650.

  • 4. Manjira Sinha, Preethy Varma, Gayatri Shivakumar, Mridula Singh, Deepthi Chander, Tridib Mukherjee, Koustuv Dasgupta. Improving Urban Transportation through Social Media Analytics, 3rd ACM IKDD Conference on Data Sciences (CoDS), 2016 (Poster)

  • 5. Tirthankar Dasgupta, Manjira Sinha, and Anupam Basu, Compositionality in Bangla Compound Verbs and their Processing in the Mental Lexicon, 12th International Conference on Natural Language Processing (ICON), 2015, pp. 60-65

  • 6. Manjira Sinha, Tirthankar Dasgupta, Anupam Basu, Text Readability in Hindi: A Comparative Study of Feature Performances Using Support Vector Machine, 11th International Conference on Natural Language Processing (ICON), 2014, pp. 92-100.

  • 7. Tirthankar Dasgupta, Manjira Sinha, Anupam Basu, Web Browsing Interface for People with Severe Speech and Motor Impairment in India, 16th International ACM SIGACCESS Conference on Computers and Accessibility(ASSETS), 2014, pp. 285-286.

  • 8. Manjira Sinha, Tirthankar Dasgupta, Anupam Basu, Development of Accessible Toolset to Enhance Social Interaction Opportunities for People with Cerebral Palsy in India, 16th International ACM SIGACCESS Conference on Computers and Accessibility(ASSETS), 2014, pp. 249-250.

  • 9. Tirthankar Dasgupta, Manjira Sinha, Anupam Basu, Web Sanyog: a Portable Assistive Web Browser for People with Cerebral Palsy, 16th International ACM Conference on Multimodal Interaction (ICMI), 2014.

  • 10. Manjira Sinha, Tirthankar Dasgupta, and Anupam Basu, Inuence of Target Reader Background and Text Features on Text Readability in Bangla: A Computational Approach, 25th International Conference on Computational Linguistics (COLING),2014, pp. 345-354.

  • 11. Manjira Sinha, Tirthankar Dasgupta, and Anupam Basu, Design and Development of an On-line Computational Framework to Facilitate Language Comprehension Research on Indian Languages, 9th International Conference on Language Resources and Evaluation, LREC'14,pp. 203-210.

  • 12. Tirthankar Dasgupta, Manjira Sinha, Anupam Basu, Computational Models to understand the Access and Representation of Bangla Polymorphemic Words in the Mental Lexicon, 24th International Conference on Computational Linguistics (COLING), 2012, pp-235-244.

  • 13. Manjira Sinha, Shakshi Sharma, Tirthankar Dasgupta, Anupam Basu, A New Readability Measure of Bangla and Hindi Texts, 24th International Conference on Computational Linguistics (COLING), 2012, pp-1141-1150.

  • 14. Tirthankar Dasgupta, Aakash Anuj, Manjira Sinha, Ritwika Ghose, Anupam Basu, VoiceMail Architecture in Desktop and Mobile Devices for the Blind People, 4th IEEE International Conference on Intelligent Human Computer Interaction (IHCI), 2012.


  • • “Transferring Partial Knowledge for Cross-domain Text Classification across Multiple Domains with Disparate Labelsets.” Patent Application Filed, 2016.

  • • “System and Method to Automatically Identify Urban Issues from Web Sources with Minimal Supervision.” Patent Application Filed, 2017.

  • • “Stance Classification of Multi-Perspective Consumer Health Information.” Patent Application Filed, 2017.

  • • “Dynamic Adaptation of Pre-trained Neural Networks for Cross-domain Learning.” Patent Application Filed, 2017.