Architecture for User's Identification on Social Media Using Writeprint based on Distributed Machine LearningTechniques

  • Hardik Ashar university of pune
  • Abhishek Murkute
Keywords: writeprint, stylometric, multi layered perceptron model, user identification, social networking website, big data, machine learning

Abstract

—From dawn of social networking sites, there has been an instantaneous growth in its popularity. It is because of this reason several issues has emerged. This has allure the cyber criminals. An illicit use of publishing online messages or blogs for illicit intent has become an issue of great concern. This has abundantly assist to a ever precarious threat to social networks. Therefore, user identification is necessary. Biometric information is of two types scilicet biological and behavior. The writeprint falls in behavioral type of biometric information. This information is used for further analysis and detection purpose. This paper is regarding improvement on previously proposed framework designed for detecting user identification for social networking websites using a writeprint approach. In writeprint approach the semantic, stylometric, sentimental properties from written text of the user are captured, which is further used to detect user identity. Initially, the content written on social networking websites is downloaded. Later from the downloaded text, numerous features are extracted using text mining or web mining techniques. Most of these features are extracted natural language processing tool. Machine Learning techniques like Multi layered perceptron mode, Support Vector Machine (SVM), ensemble modeling methods in the distributed environment which is a supervised machine learning technique is used for user identity detection.

References

[1] J. Li, R. Zheng, and H. Chen,"From fingerprint to writeprint.", Communications of the ACM - Supporting exploratory search. 76-82. Available: http://dl.acm.org/citation.cfm?id=1121951. [2] Sara Keretna, Ahmad Hossny, Doug Creighton"Recognising User Identity in Twitter Social Networks via Text Mining", In the journal of Institute of Electrical and Electronics Engineers, 2013. [3] Sara EL MANAR EL BOUANANI, Ismail KASSOU," Using lexicometry and vocabulary analysis techniques to detect a signature for web profile", In the journal of IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, 2013. [4] Fredrik Johansson, Lisa Kaati, AmendraShrestha," Detecting Multiple Aliases in Social Media", In the journal of Institute of Electrical and Electronics Engineers, 2013. [5] Shlomo Argamon, Moshe Koppel, James W. Pennebaker and Jonation Schler, “Automatically Profiling the author of an Anonymous text”, In communication of ACM , Vol 52,No:3, 2009. [6] Hong Zhu, Zhaoli Zhang, Zhi Liu,"Online Writeprint Identification Via Multi-PRM", In the journal of Institute of Electrical and Electronics Engineers,2012. [7] Farkhund Iqbal, Hamad Binsalleeh, Benjamin C.M. Fung and Mourad Dabbabi, “Mining writeprints from anonymous e-mails for forensic investigation”, In journal of Science Direct,2010. [8] Ahemed Abbsi and Hsinchun Chen,"Writepeint:A Stylometreic Approach to Identitfication and similarity detection in cyberspace ",In the journel of ACM transaction on Information system, Vol 26,No:2, Article 7, 2008. [9] Abhishek M. Murkute, Jayant Gadge, “Framework for User Identification Using Writeprint Approach”, IEEE International Conference on Technologies for Sustainable Development (ICTSD2015), Feb. 04 – 06, 2015, Mumbai, India. [10] Yamamoto, Yusuke. "Twitter4J-A Java Library for the Twitter API." (2010). Available: http://twitter4j.org/en, last visited on 04-11-2014. [11] K.Toutanova Stanford Log-linear Part-Of-Speech tagger Available: http://nlp.stanford.edu/software/tagger.shtml. [12] Barrey L. Kalman and Stan C. Kwasny, “Why tanh: Chooosing a Sigmodial funtion”, In the journal of Institute of Electrical and Electronics Engineers, 1992. [13] Jacek M. Zurada, “Introduction to Artificial Neural System”, West Publishing Company, 1992. [14] Tom M. Mitchell,"Machine Learning",McGraw-Hill Publication,1997. [15] Ahemed Abbsi and Hsinchun Chen,""Sentiment analysis in multi languages: Feature selection for opnion classification", In the journel of ACM transaction on Information system, Vol 26,No:3, Article 12, 2008. [16] Abhishek M. Murkute, Jayant Gadge, “ Identification of Users on Social Networking Website Using Soft Computing Technique ”,( presented in) IEEE International Conference on Technologies for Internet of things (IOTA-2016), Jan. 22 – 24, 2015, Pune, India. [17] Kiranjot Kaur , Anjandeep Kaur Rai." A Comparative Analysis: Grid, Cluster and Cloud Computing ", International Journal of Advanced Research in Computer and Communication Engineering Vol. 3, Issue 3, March 2014 [18] Zaharia, Matei, et al. "Resilient distributed datasets: A fault-tolerant abstraction for in-memory cluster computing." Proceedings of the 9th
USENIX conference on Networked Systems Design and Implementation. USENIX Association, 2012. [19] Abadi, Martın, et al. "Tensorflow: Large-scale machine learning on heterogeneous distributed systems." arXiv preprint arXiv:1603.04467 (2016).
Published
2018-03-23
How to Cite
Ashar, H., & Murkute, A. (2018). Architecture for User’s Identification on Social Media Using Writeprint based on Distributed Machine LearningTechniques. Asian Journal For Convergence In Technology (AJCT) ISSN -2350-1146, 3(3). Retrieved from http://asianssr.org/index.php/ajct/article/view/237
Section
Article

Most read articles by the same author(s)

Obs.: This plugin requires at least one statistics/report plugin to be enabled. If your statistics plugins provide more than one metric then please also select a main metric on the admin's site settings page and/or on the journal manager's settings pages.