Manuscript Number : CSEIT2173123
Toxic Word Analyzer
Authors(1) :-Dhairya Timbadia In this generation social media has been a huge part of our lives and there is no need to say that the current generation spend a huge amount of time on their social media accounts. Apart from there being a good social media influencer there are a lot of people who spread hatred among these influencers as well as among each other. I have tried to make a speedometer which would be able to tell the toxicity of the words that are basically used in the input sentence or paragraph. The main processing that would be done on the sentence or the paragraph would be removing punctuation marks, tokenization on the words, ‘Stop’ word removal, bigram creation, matching tokens with predefined dictionary, generating toxicity percent using scaling.
Dhairya Timbadia Toxicity, Tokenization, Speedometer Publication Details Published in : Volume 7 | Issue 3 | May-June 2021 Article Preview
Computer Engineering, Rajiv Gandhi Institute of Technology, Mumbai, Maharashtra, India
Date of Publication : 2021-06-30
License: This work is licensed under a Creative Commons Attribution 4.0 International License.
Page(s) : 578-581
Manuscript Number : CSEIT2173123
Publisher : Technoscience Academy
Journal URL : https://res.ijsrcseit.com/CSEIT2173123
Citation Detection and Elimination |
|
|
BibTeX | RIS | CSV