Toxic Word Analyzer

Authors(1) :-Dhairya Timbadia

In this generation social media has been a huge part of our lives and there is no need to say that the current generation spend a huge amount of time on their social media accounts. Apart from there being a good social media influencer there are a lot of people who spread hatred among these influencers as well as among each other. I have tried to make a speedometer which would be able to tell the toxicity of the words that are basically used in the input sentence or paragraph. The main processing that would be done on the sentence or the paragraph would be removing punctuation marks, tokenization on the words, ‘Stop’ word removal, bigram creation, matching tokens with predefined dictionary, generating toxicity percent using scaling.

Authors and Affiliations

Dhairya Timbadia
Computer Engineering, Rajiv Gandhi Institute of Technology, Mumbai, Maharashtra, India

Toxicity, Tokenization, Speedometer

  1. Thedora Chu, Max Wang, Kylie Jue. “Comment Abuse Classification with Deep Leraning.” Stanford University.
  2. Karthik Diankar, Roi Riechart, Henry Lieberman. “Modeling the Detection of Textual Cyberbullying.” Massachusetts Institute of Technology, Cambridge MA 02139 USA.
  3. Xin Wang, Yuanchao Li, Chengjie Sun, Baoxum Wang and Xialong Wang. “Polarities of Tweets by Composing Word Embeddings with Long Short Term Memory.” 7th International Joint Conference of Natural Language Processing. July-2005.
  4. S. V. Georgakopoulus, A. G. Vrahatis, S. K. Tasoulis, V. P. Plagianakos. “Convolutional Neural Networks for Toxic Comment Classification.” arXiv:1802.099574v1 cs.CL], 27 Feb 2018.
  5. Kevin Khieu, Neha Narwal. “Detecting and Classifying Toxic Comments.” Stanford University- CSS224N.
  6. C. Nobata, J.Tetreault,A. Thomas, Y. Mehdad and Y.Chang. “Abusive language detection in online user content.”

Publication Details

Published in : Volume 7 | Issue 3 | May-June 2021
Date of Publication : 2021-06-30
License:  This work is licensed under a Creative Commons Attribution 4.0 International License.
Page(s) : 578-581
Manuscript Number : CSEIT2173123
Publisher : Technoscience Academy

ISSN : 2456-3307

Cite This Article :

Dhairya Timbadia, "Toxic Word Analyzer", International Journal of Scientific Research in Computer Science, Engineering and Information Technology (IJSRCSEIT), ISSN : 2456-3307, Volume 7, Issue 3, pp.578-581, May-June-2021. Available at doi : https://doi.org/10.32628/CSEIT2173123
Journal URL : https://res.ijsrcseit.com/CSEIT2173123 Citation Detection and Elimination     |      |          | BibTeX | RIS | CSV

Article Preview