Paper title:

Mining Social Media and DBpedia Data Using Gephi and R

DOI: https://doi.org/10.4316/JACSM.201801002
Published in: Issue 1, (Vol. 12) / 2018
Publishing date: 2018-04-19
Pages: 14-20
Author(s): HUSSAIN Sadiq, MUHAMMAD L. J., YAKUBU Atomsa
Abstract. The big data is playing a big role in the field of machine learning and data mining. To extract meaningful and interesting information from big data mining is a challenge. The size of the data at social media and Wikipedia are increasing exponentially. To visualize such huge data is another aspect of big data. The roles of graphs are becoming important in case of visualization and modelling of such data. Gephi and R are two important visualization and exploration tools in this field. Using graph, one may find and calculate modularity, eccentricity, Indegree, Outdegree, betweenness centrality etc. In this paper, we had used Dbpedia, facebook and twitter datasets. We had used Gephi and R to look inside the structure of such data and comparing different statistics based on the graph by exploring the graphs.
Keywords: Big Data, DBpedia, Gephi, R, Graph
References:

1. Alistair, W., Ali F. and Ilia L. (2015). Mapping networks of influence: Tracking Twitter conversations through time and space, Journal of Audience and Reception Studies 12(1).

2. Ángel, H. (2014). Using Gephi to visualize online course participation: a Social Learning Analytics approach, Italian Journal of Education Technology, 22(3)

3. Basics of Graph theory. Retrieved from www.cse.iitkgp.ac.in/~animeshm/FirstHalfScribe.pdf Accessed date 3rd December 2017

4. Bandgar, B. M., Karande, D. N. and Binod K. (2014).An Analysis of Social Network Data, IFRSA International Journal of Data Warehousing & Mining |Vol 4|issue3|August 2014

5. Bastian M., Heymann S., Jacomy M. (2009). Gephi: an open source software for exploring and manipulating networks. International AAAI Conference on Weblogs and Social Media.

6. Bernhard Rieder (2013). “Studying Facebook via data extraction: the Netvizz application”, Proceedings of the 5th Annual ACM Web Science Conference, pp 346-355.

7. Bizer, Christian; Lehmann, Jens; Kobilarov, Georgi; Auer, Soren; Becker, Christian; Cyganiak, Richard; Hellmann, Sebastian (September 2009). "DBpedia - A crystallization point for the Web of Data" (PDF). Web Semantics: Science, Services and Agents on the World Wide Web. 7 (3): 154–165. doi:10.1016/j.websem. 2009.07.002. ISSN 1570-8268.

8. Dedić, N.; Stanier, C. (2017). "Towards Differentiating Business Intelligence, Big Data, Data Analytics and Knowledge Discovery". 285. Berlin; Heidelberg: Springer International Publishing. ISSN 1865-1356. OCLC 909580101.

9. Fox, J. & Andersen, R. ( 2005). Using the R Statistical Computing Environment to Teach Social Statistics Courses, Department of Sociology, McMaster University.

10. Georgios, A. P., Maria, S., Charalampos, N. M., Theodoros, G. S., Sophia, K., Jan Aerts, R. S. and Pantelis, G. B.. (2001).Using graph theory to analyze biological networks, BioData Mining 4(10).

11. Graham, A. L., Zhao, K., Papandonatos, G. D., Erar, B., Wang, X., Amato, M. S., et al. (2017.) A prospective examination of online social network dynamics and smoking cessation. PLoS ONE 12(8): e0183655. https://doi.org/10.1371/journal.pone.0183655

12. Hebeler, John; Fisher, Matthew; Blace, Ryan; Perez-Lopez, Andrew (2009). “Semantic Web Programming”. Indianapolis, Indiana: John Wiley & Sons. p. 406. ISBN 978-0-470-41801-7.

13. Jacomy M, Venturini T, Heymann S, Bastian M (2014) ForceAtlas2, a Continuous Graph Layout Algorithm for Handy Network Visualization Designed for the Gephi Software. PLoS ONE 9(6): e98679. doi:10.1371/journal.pone.0098679.

14. Jeffrey S. R. (2012). R Studio: A Platform-Independent IDE for R and Sweave, Journal of Applied Econometrics, 27: 167–172 (2012)

15. Kancharla, S. and, Sudhakar, N. R. (2017). Sentiment Change Detection in Twitter Data Using R Studio, International Journal for Research in Applied Science & Engineering Technology (IJRASET), 5(5).

16. Kefi, H., Indra, S. and Abdessalem, T. (2016). Social Media Marketing Analytics: A Multicultural Approach Applied To The Beauty & Cosmetics Sector" (2016). PACIS 2016 Proceedings, 176.

17. Lehmann, J., Isele, R., Jakob, M., Jentzsch, A., Kontokostas, D., Mendes, P. N., Hellmann, S., Morsey, M. Van Kleef, P., Auer, S. and Bizer, C. (2015). DBpedia - a large-scale, multilingual knowledgebase extracted from Wikipedia. Semantic Web Journal, 6(2):167

18. Mathieu, B. and Sebastien, H. and Mathieu J. (2009). Gephi: An Open Source Software for Exploring and Manipulating Networks, Proceedings of the Third International ICWSM Conference, 361-362 (2009)

19. Matt, H., (2011). Databases and the Web. Retrieved from www.tinman.cs.gsu.edu/~raj/8711/sp11/presentations/DBPedia.pdf . Accessed date 3rd December 2017

20. Muhammad Lawan Jibril, Ibrahim Ali Mohammed and Atomsa Yakubu (2017). “Social Media Analytics Driven Counterterrorism Tool to improve Intelligence Gathering towards Combating Terrorism in Nigeria”, International Journal of Advanced Science and Technology Vol.107, pp.33-42.

21. Mohamed Morsey, Jens Lehmann, Sören Auer, Claus Stadler, Sebastian Hellmann, (2012) "DBpedia and the live extraction of structured data from Wikipedia", Program: electronic library and information systems, Vol. 46 Iss: 2, pp.157 – 18.

22. Morsey, J., Lehmann, M., Auer, S., Stadler, C. and Hellmann, S. (2012). DBpedia and the live extraction of structured data from Wikipedia. The program, 46(2):15.

23. Newman, M. E. J. (2006). "Modularity and community structure in networks". Proceedings of the National Academy of Sciences of the United States of America. 103 (23): 8577–8696. arXiv: physics/0602124. Bibcode:2006 PNAS. 103.8577N. doi:10.1073/pnas.0601602103. PMC 1482622 PMID 16723398.

24. Sanchita, P. (2016).WhatsApp Group Data Analysis with R, International Journal of Computer Applications, 154 (4).

25. Sonal Singh and Shyam S Choudhary, (2017). Social Media Data Analysis: Twitter Sentimental Analysis using R Language, Proceedings of IEEEFORUM International Conference, 01st October 2017, Pune, India

26. Soren, A., Christian, B., Georgi, K., Jens, L., Richard, C., and Zachary, I. (2007). DBpedia: A Nucleus for a Web of Open Data, Proceedings of the 6th international The semantic web and 2nd Asian conference on Asian semantic web conference, ISWC'07/ASWC'07, pg 722-735

27. Tippmann, Sylvia (29 December 2014). "Programming tools: Adventures with R". Nature. 517: 109–110. doi:10.1038/517109a

28. West, D. B. (2000).Introduction to Graph Theory, 2nd ed. Englewood Cliffs, NJ: Prentice-Hall.

Back to the journal content
Creative Commons License
This article is licensed under a
Creative Commons Attribution-ShareAlike 4.0 International License.
Home | Editorial Board | Author info | Archive | Contact
Copyright JACSM 2007-2024