Study on the Use of Data Island Method for Creating Metadata Records with Indexability and Visibility of Element Tag Names in the Web Search Engines Environment: the Improvement of Access to Content Objects

Document Type : مقالات پژوهشی

Authors

university of azad

Abstract

Purpose: The aim of this research was to examine the use of data island method for creating metadata records based on DCXML, MARCXML, and MARC21with indexability and visibility of element tag names in the Web search engines environment.
Methodology: 600 metadata records were developed in two groups (300 XML-based records as the control group with the normal structure, and 300 HTML-based records in an experimental group with special structure embedded in the tag of HTML based on the data island strategy). These records were analyzed through an experimental approach. The records of these two groups were published on two independent websites and were introduced to Google and Yahoo! search engines for indexing.
Findings: Findings show that all the element tag names of the metadata records created based on the data island method relating to the experimental group indexed by Google and Yahoo! were visible in the search results. But the element tag names in the control group’s metadata records were not indexed by the search engines. Based on this study, it is possible to retrieve the metadata records by their element tag name in search engines. But the records of the control group are accessible by the element values only. The research suggests some patterns to metadata creators and end users.

Keywords


Alexa: The Web Information Company (2011). Global Top 500. Retrieved 14 Dec. 2011 from: http://www.alexa.com/site/ds/top_sites?ts_mode=global&lang=none
American Society for Indexing (2012).Indexing the Web.Retrieved 7 May 2011 from: http://www.asindexing.org/i4a/pages/index.cfm?pageid=3418
Aqa Abedi, E. (2012), “The effect of syntax on the indexing & ranking of metadata records by the web search engine: a comparative study on MARCXML and DCXML metadata records”, unpublished Master’s thesis, Science and Research Branch, Islamic Azad University, Tehran (in Persian).
Asadi, S.aied; Jamali, H.amid R. (2004). Shifts in Search Engine Development: A Review of Past, Present and Future Trends in Research on Search Engines. Retrieved 7 May 2011 from: http://www.webology.ir/2004/v1n2/a6.html
Bifet, A.lbert; Castillo, C.arlos (2005).An Analysis of Factors Used in Search Engine Ranking. Retrieved 7 May 2011 from: http://airweb.cse.lehigh.edu/2005/bifet.pdf
Bray, T.im; Paoli, J.ean; Sperberg-McQueen, C. M.; Maler, E.ve; Yergeau, F.rançois (2008). Extensible Markup Language (XML) 1.0 (Fifth Edition): W3C Recommendation 26 November 2008. Retrievd 7 Feb. 2012 from: http://www.w3.org/TR/xml/
Buzzi, M.arina; Andronico, P.atrizia; Leporini, B.arbara (2005).Accessibility and Usability of Search Engine Interfaces: Preliminary Testing. Retrieved 7 Feb. 2012 from: http://www.ui4all.gr/workshop2004/files/ui4all_proceedings/adjunct/accessibility/58.pdf
Campex (2012).Top Search Engines. Retrieved 7 Feb. 2012 from: http://capmex.biz/resources/top-search-engines
ComScore (2012).comScore Releases May 2012 U.S. Search Engine Rankings. Retrieved 7 Feb. 2012 from: http://www.comscore.com/Press_Events/Press_Releases/2012/6/comScore_Releases_May_2012_U.S._Search_Engine_Rankings
Expertrating (2012).XML Tutorial: Embedding XML in HTML. Retrieved 7 Feb. 2012 from:http://www.expertrating.com/courseware/XMLCourse/XML-Embedding-HTML-8.asp
Gigee, G.rant (2006).MARC and MARCXML. Retrieved 5 Nov. 2011 from:http://threegee.files.wordpress.com/2006/05/marcxml.pdf
Gill, T.oney (2008).Metadata and the Web: Introduction to Metadata. Retrieved 5 Nov. 2011 from:http://www.getty.edu/research/publications/electronic_publications/intrometadata/metadata.pdf
Google (2012).Web Master Tools: Meta tag. Retrieved 5 Feb. 2012 from:http://www.google.com/support/webmasters/bin/answer.py?answer=79812
Henshaw, R.obin; Valauskas, E.dward J. (2001).Metadata as a Catalyst: Experiments with Metadata and Search Engines in the Internet Journal, First Monday. Retrieved 14 Dec. 2011 fromwww.librijournal.org/pdf/1999-3pp125-131.pdf
Kelly (2006).MARC and MARC XML. Retrieved 14 Dec. 2011 from http://threegee.files.wordpress.com/2006/05/marcxml.pdf
Kyrnin, J.ennifer (2012). What Is Pre-Formatted Text?.Retrieved 14 Dec. 2012from:http://webdesign.about.com/od/htmltags/f/blfaqpre.htm
Lewis, E.dward (2008). Top Ten Search Engines.Retrieved 14 Dec. 2011 from http://www.seoconsultants.com/search-engines/
Luk, R.obert; Chan, A.lvin; Dillon, T.haram; Leong, H. V. (2000).A Survey of Search Engines for XML Documents. Retrieved 14 Dec. 2011 from: http://www.haifa.il.ibm.com/sigir00-xml/final-papers/Luk/XMLSUR.htm
Luk, R.obert; Leong, H. V.; Dillon, T.haram S.; Chan, A.lvin T. S.; Croft, W. B.ruce; Allan, J.ames (2002). A Surveyin Indexing and Searching XML Documetns.Retrieved 14 Dec. 2011 from: http://onlinelibrary.wiley.com/doi/10.1002/asi.10056/full
Mascaro, Michelle J (2004). The Value of Flexibility in Metadata Schemas.Retrieved 14 Dec. 2011 from: http://etd.ils.unc.edu:8080/dspace/bitstream/1901/57/1/michellemascaro.pdf
Mohamed, Kh.aledA.f. (2006). The impact of metadata in web resources discovering.Online Information Review. 30 (2): 155-167
Qin, J.ian (2000). Representation and Organization of Information in the Web Space: From MARC to XML. Retrieved 12 Dec. 2011 from: http://inform.nu/Articles/Vol3/v3n2p83-88.pdf
Quevedo-Torrero, J.esus U.baldo (2004). IMPROVING WEB RETRIEVAL BY MINING THE HTML TAGS FOR KEYWORDS AND EXPLORING THE HYPERLINK STRUCTURES WEB PAGES.Ph. D. Dissertation, Department of Computer Science, University of Houston. Retrieved 12 Dec. 2011 from: ‎http://wwwlib.umi.com/dissertations/fullcit/3156028‎
Safari, M.ehdi (2005).Search Engine and Resource Discovery on the Web: Is Dublin Core an Impact Factor. Retrieved 12 Dec. 2011 from: www.webology.ir/2005/v2n2/a13.html
Search Engine Watch (2007). How Search Engines Work. Retrieved 12 Dec. 2011 from: http://searchenginewatch.com/article/2065173/How-Search-Engines-Work
Sharif, A. (2007), “Study the effectiveness of metadata elements on web page visibility in publicsearch engines”, available at: http://eprints.rclis.org/handle/10760/9171#.UHPNcVG94hA (accessed December 7, 2012) (in Persian).
Sokvine, L.loyd (2000). An Evaluation of the Effectiveness of Current Dublin Core Metadata for Retrieval. Retrieved 12 Dec. 2011 from: www.vala.org.au/vala2000/2000pdf/Sokvitne.PDF
Tabatabai Amiri F, Taheri S. M., Farajpahlou A, Osareh F, Moarrefzadeh A. “Web Search Engines and Indexing and Ranking the Content Object Including Metadata Elements Available at the Dynamic Information Environments”, Journal of Information Processing and Management. 2012; 27 (4) (in Persian).
Taheri, S. M. (2008). “A comparative study on the efficiency of Dublin Core Metadata Initiative and MARC21 Metadata Format in organizing information available on the World Wide Web”, Library and Information Quarterly. No.: 43 (in Persian).
Taheri, S. M.; Hariri, N.adjla (2012).A Comparative Study on the Indexing and Ranking of the Content Objects Including the MARCXML and Dublin Core's Metadata Elements by General Search Engines. Electronic Library.Vol 30, issue 4
Taheri, S.M. and Hariri, N. (2012), “A comparative study on the indexing and ranking of the content objects including the MARCXML and Dublin Core’s metadata elements by general search engines”, Library and Information Quarterly. No.: 48 (in Persian).
Taheri, S.M., Hariri, N. and Fattahi, S.R. (2014), “Interoperability between metadata systems and web search engines: Current Development and Trends”, Library and Information Research Journal 3 (2) (in Persian).
Turner, T.homas P.; Brackbill, L.ise (1998).Rising to the Top: Evaluating the Use of the HTML META Tag To Improve Retrieval of World Wide Web Documents through Internet Search Engines Retrieved 12 Dec. 2011 from: http://cat.inist.fr/?aModele=afficheN&cpsidt=1748620
Woychowsky, E.dmond (2003). XML data islands offer a useful mechanism to display Web form: data. Retrieved 12 Dec. 2011 from: http://www.techrepublic.com/article/xml-data-islands-offer-a-useful-mechanism-to-display-web-form-data/1058668
Yahoo (2012).What are meta tags?. Retrieved 12 Feb. 2012 from: http://help.yahoo.com/l/us/yahoo/smallbusiness/promotion/meta/meta-01.html.
Zhang, J.in; Dimitroff, A.lexandra (2004).Internet search engine's response to metadata Dublin Core implementation. Retrieved 12 Dec. 2011 from http://portal.acm.org/citation.cfm?id=1142111
Zhang, J.in; Dimitroff, A.lexandra (2005a).The impact of metadata implementation on Webpage visibility in search engine result (Part II). Retrieved 12 Dec. 2011 from: http://www.sciencedirect.com/science?_ob=ArticleURL&_udi=B6VC8-4BHCBX4-2&_user=10&_rdoc=1&_fmt=&_orig=search&_sort=d&view=c&_acct=C000050221&_version=1&_urlVersion=0&_userid=10&md5=a853d410a866732d3f8ab5dd3217d412
Zhang, J.in; Dimitroff, Al.exandra (2005b).The impact of Webpage content characteristics on webpage visibility in search engineresult (Part I).Retrieved 12 Dec. 2011 from: http://web.simmons.edu/~braun/467/part_1.pdf
CAPTCHA Image