Text classification on Tamil

Omprakash Yadav(1), Alcina Judy(2), Praveen D’souza(3), Calvin Galbaw(4*), Hinal Rane(5),

(1) 
(2) 
(3) 
(4) 
(5) 
(*) Corresponding Author

Abstract


By and large, we don't know to talk and read the territorial dialects that are spoken in our nation. So we have accepted Tamil language as it is our territorial and numerous doesn't get it. In our task, the content in Tamil language is stacked from Wikipedia. It is then sifted through and extraordinary characters are evacuated it is then characterized by the titles like id, title, URL, etc. It is then used to prepare the model utilizing CNN calculation and the dataset is created. Along these lines, you would now be able to test utilizing an irregular Wikipedia page and the content is grouped by the titles and anticipated.

Full Text:

PDF

References


E. Annamalai and S. B. Steever. Modern Tamil in Dravidian languages. Newyork: Routledge Publication, 1999.

R. K. Belew, “Adaptive information retrieval.” In Proceedings of the 12th annual international ACM/SIGIR conference on research and development in information retrieval, NY, 11–20, 1989.

L. Chanunya and R. Peachavanish, “Automatic Thai language essay scoring using neural network and latent semantic analysis.” In Proceedings of the first Asia international conference on modeling and simulation, 2007.

C.H. Li and S.C. Park, “Text categorization based on artificial neural networks.” In ICONIP, 4234, LNCS 302–311, 2006.

C.H. Li and S.C. Park, “Neural network for text classification based on singular value decomposition.” In Seventh international conference on computer and information technology, 47–52, 2007.




DOI: https://doi.org/10.24071/ijasst.v3i2.2826

Refbacks

  • There are currently no refbacks.









Publisher : Faculty of Science and Technology

Society/Institution : Sanata Dharma University

 

 

 

Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 International License.