Optimizing RDF Clusters using ACO

Abstract

Resource Description framework (RDF) in semantics web faces several challenges in terms of rapid increase in its volume and continuous change. This paper presents a new clustering methodology for semantic web data by utilizing ant colony optimization algorithm. The methodology has two pre-processing steps to extract RDF instances and compute a distance matrix between these instances. Next, ACO is implemented to find clusters based on ants discovering the shortest path. The algorithm also uses two objective functions, compactness and separation, to evaluate the discovered clusters. The experiments are conducted on the proposed methodology and showed promised results for clustering quality.

Country : Yemen

1 Rasha A. Bin-Thalab2 Seham A. Bamatraf

  1. Assist. Prof., Department of Computer Engineering, College of Engineering & Petroleum, Hadhramout University, Mukalla, Yemen
  2. Assist. Prof., Department of Computer Engineering, College of Engineering & Petroleum, Hadhramout University, Mukalla, Yemen

IRJIET, Volume 4, Issue 2, February 2020 pp. 58-63

References

  1. T. Berners-Lee, J. Hendler, and O. Lassila, "The Semantic Web," Scientific American, vol. 5, pp. 34-43, 2001.
  2. RDF, Working, and Group, "Resource Description Framework (RDF)." vol. 2019: W3C Semantic web, 2014.
  3. A.Pugliese, O. Udrea, and V. S. Subrahmanian, "Scaling RDF with Time," in Proceedings of the 17th international conference on World Wide Web Beijing, China: ACM, 2008, pp. 605-614.
  4. V. Castellana, J. Weaver, A. Morari, A. Tumeo, D. Haglin, J. Feo, and O. Villa, "Scaling RDF Triple Stores in Size and Performance. Modeling SPARQL Queries as Graph Homomorphism Routines," Handbook of Statistics, pp. 339-362, 2015.
  5. J. Hjelm, Creating the Semantic Web with RDF: Professional Developer's Guide vol. 1: Wiley, 2001.
  6. T. Berners-Lee, R. Fielding, and L. Masinter, "Uniform Resource Identifier (URI): Generic Syntax," RFC, vol. 3986, pp. 1-61, 2002.
  7. D. Dosso, "Keyword Search on RDF Datasets," in ECIR 2019: Advances in Information Retrieval, Cham, 2019, pp. 332-336.
  8. G. Aluç, M. T. Özsu, and K. Daudjee, "Building self-clustering RDF databases using Tunable-LSH," The VLDB Journal, vol. 28, pp. 173–195, 2019.
  9. S. Bamatraf and R. Bin-Thalab, "Clustering RDF data using K-medoids," in International Conference of Intelligent Computing and Engineering, Mukalla, 2019, p. 8.
  10. S. Eddamiri, E. M. Zemmouri, and A. Benghabrit, "An improved RDF data Clustering Algorithm," in Second International Conference on Intelligent Computing in Data Sciences (ICDS 2018) Fez-Morocco: Elsevier B. V., 2018.
  11. S. Giannini, "RDF Data Clustering," in BIS 2013 Workshop, LNBIP 160, 2013, pp. 220 - 231.
  12. G. A. Grimnes, P. Edwards, and A. Preece, "Instance Based Clustering of Semantic Web Resources," in ESWC 2008: The Semantic Web: Research and Applications Berlin, Heidelberg, 2008, pp. 303-317.
  13. S. Koske, "Swarm Approaches For Semantic Triple Clustering And Retrieval In Distributed RDF Spaces," in FACHBEREICH MATHEMATIK UND INFORMATIK SERIE B • INFORMATIK. vol. M.Sc. Berlin: Freie Universitate Berlin, 2009, p. 146.
  14. J. Yang and J. Yang, "Intelligence Optimization Algorithms: A Survey," Journal of Advancements in Computing Technology, vol. 3, pp. 144-152, 2011.
  15. G. Li and K. Xia, "An Improved Data Mining Technique Combined Apriori Algorithm with Ant Colony Algorithm and its Application," International Journal of Digital Content Technology and its Applications, vol. 5, pp. 241-249, 2011.
  16. G. Zhe, L. Dan, A. Baoyu, O. Yangxi, C. Wei, N. Xinxin, and X. Yang, "An Analysis of Ant Colony Clustering Methods: Models, Algorithms and Applications," International Journal of Advancements in Computing Technology (IJACT), vol. 3, 2011.
  17. L. Kaufman and P. J. Rousseeuw, "Clustering by means of Medoids, in Statistical Data Analysis Based on the L1 - Norm and Related Methods," in Y. Dodge North- Holland, 1987, pp. 405–416.
  18. lin, nkaya, K. Sinan, gil, E. Nur, and zdemirel, "Ant Colony Optimization based clustering methodology," Appl. Soft Comput., vol. 28, pp. 301-311, 2015.
  19. T. Runkler, "Ant colony optimization of clustering models," International Journal of Intelligent Systems, vol. 2, pp. 1233-1261, 2005.
  20. J. L. Deneubourg, S. Goss, N. Franks, A. Sendova-Franks, C. Detrain, L. Chr, and tien, "The dynamics of collective sorting robot-like ants and ant-like robots," in Proceedings of the first international conference on simulation of adaptive behavior on From animals to animats Paris, France: MIT Press, 1990.
  21. C.-F. Tsai, C.-W. Tsai, and H.-C. Wu, "ACODF: a novel data clustering approach for data mining in large databases," Journal of System Software, vol. 73, pp. 133-145, 2004.
  22. D. Graff, "Implementation and Evaluation of a SwarmLinda System," Masterarbeit. FU Berli, Berlin 2008.
  23. M. Harasic, A. Augustin, P. Obermeier, and R. Tolksdorf, "RDFSwarms: selforganized distributed RDF triple store," in Proceedings of the 2010 ACM Symposium on Applied Computin Sierre, Switzerland: ACM, 2010, pp. 1339-1340.
  24. M. Dorigo, "Optimization, Learning and Natural Algorithms." vol. PhD thesis: Politecnico di Milano, 1992.
  25. M. Dorigo, V. Maniezzo, and A. Colorni, "Ant system: optimization by a colony of cooperating agents," IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics), vol. 26, pp. 29-41, 1996.
  26. M. Dorigo and L. M. Gambardella, "Ant colony system: a cooperative learning approach to the traveling salesman problem." IEEE Trans. Evolutionary Computation, vol. 1, pp. 53-66, 1997.
  27. S. B.Needleman and C. D.Wunsch, "A general method applicable to the search for similarities in the amino acid sequence of two proteins," Journal of Molecular Biology (jmb), vol. 48, pp. 443-453, 1970.
  28. T. Pedersen, S. Patwardhan, and J. A Michelizzi, "WordNet::Similarity - Measuring the Relatedness of Concepts," in Association for Computational Linguistics, Boston, Massachusetts, USA, 2004, pp. 38-41.
  29. C. Knox, V. Law, T. Jewison, P. Liu, S. Ly, A. Frolkis, A. Pon, K. Banco, C. Mak, V. Neveu, Y. Djoumbou, R. Eisner, A. Guo, and D. Wishart, "DrugBank 3.0: a comprehensive resource for 'omics' research on drugs.." vol. 2019: Nucleic Acids Res, 2011.