Empirical study on crawler visibility of PDF documents in digital libraries.
Proceedings of The Third IEEE International Conference on Computer Science and Information Technology (ICCSIT 2010). 10-13 July. Chengdu, China. 373-379.
Weideman, M. 2010. Empirical study on crawler visibility of PDF documents in digital libraries. Proceedings of The Third IEEE International Conference on Computer Science and Information Technology (ICCSIT 2010). 10-13 July. Chengdu, China. 373-379. Online: http://web-visibility.co.za/website-visibility-digital-library-seo/
Digital library users might not enter a digital library through homepage menus. As a result, digital library owners should consider the visibility to search engines of stored PDF documents. The aim of this research project was to determine to what extent the visibility of these PDF documents can be
In a series of empirical experiments, 100 PDF documents stored on digital libraries were identified an inspected. Searches were done for them and rankings on search engine result pages recorded. The current visibility of these documents was then calculated. After submission to Google, a waiting period was allowed for crawler visitation and the searches repeated.
The results of these experiments proved that the visibility of these documents could be improved only marginally. It is therefore concluded that the designers of university digital libraries should consider other alternatives, such as providing text extracts of PDF documents, to enhance the overall visibility of content.
- Weideman, M. "Internet searching as a study aid for information
technology and information systems learners at a tertiary level."
Unpublished PhD thesis, University of Cape Town, Cape Town,
- Goncalves, M.A., Fox, E.A. and Watson, L.T. "Towards a digital
library theory:a formal digital library ontology" Int J Digit Libr
- Ngindana, M. "Visibility of e-commerce websites to search
engines: a comparison between text-based and graphic-based
hyperlinks." Unpublished MTech thesis, Cape Peninsula University
of Technology, Cape Town, 2006.
- Karim, J., Antonellis, I., Ganapathi, V. and Garcia-Molina, H. "A
dynamic navigation guide for webpages."Proc. CHI 2009, April 4-9,
Boston, MA, 2009.
- Ingwersen, P.: "Information retrieval interaction." Taylor Graham,
- Mendelson, E. "A PDQ Guide to PDFs." PC Magazine, 30-31,
November 6, 2007.
- McClure, M. "Vitrium adds sales power to PDFs." Information Today,
www.infotoday.com , November 2009.
- Donovan, S.K. "A tax on productivity?" Journal of Scholarly
Publishing 40(2): 201-205, 2009.
- Jacsó, P. "Tools for unearthing PDF files." Information Today
48-49. www.infotoday.com May, 2001.
- Notess, G. "Internet search engine update." 28:5, Sept/Oct,
- Anonymous. "PDF searcher finds technical data." Machine Design,
October 26, www.machinedesign.com, 2006.
- Yahoo. "What are the different sections on the search results
- Anonymous.Corporation for National Research
- Skibinski, P, and Swacha, J. "The efficient storage of text
documents in digital libraries." Information Technology and Libraries,
September 143-153, 2009.
- Hadro, J. "Ebrary offers self-service platform." Library Journal 22.
- Weideman, M.: "Website visibility: the theory and practice of
improving rankings." Chandos, Oxford, 2009.
- argolzaei, P. and Soleymani, F. "PageRank problem, survey and
future research directions." International Mathematical
Forum, 5(19):937-956, 2010.