Tools: Team Two Data
- 1 Search frequency data in academic literature
- 2 Historians search behaviour on the web in academic literature
- 3 Number of search terms used in MarineLives wiki search box
- 4 Type of search terms used in MarineLives wiki search box
- 5 List of search terms used in MarineLives wiki search box
- 6 Comparing Google search constrained to MarineLives and MarineLives in-wiki search engine
This page is a repository of data generated by Team Two of the MarineLives Digital Pop Up Lab, which is exploring how historians approach historical search when they are looking for people, places and dates.
We monitor useage of the MarineLives wiki using Google Analytics. Data from the Google Analytics package enables us to analyse the specific pages viewed by users within the wiki, but not the identity of the users. Data are also captured by Google Analytics regarding searches conducted by users when viewing the MarineLives wiki, using the wiki's own search box.
In the period June 3rd 2016 to July 3rd 2016, there were two hundred and seventy five searches performed by MarineLives wiki users.
We have analysed this data for (1) the number of search terms used in single searches (2) the types of content searched for in single searches
Our conclusion is that "simple searches" using one or two search terms in a single search dominate, and that the dominant content searched for are persons. We are interested in comparing these results with the results from other history oriented content based websites.
Search frequency data in academic literature
Anne Aula, Rehan M. Khan, Zhiwei Guan (2010) survey the literature for search term frequency. They report, based on search logs, that the average number of query termns is between 2.35 terms and 2.6 terms per query. There is some evidence to suggest that smart phone users enter a slightly higher number of terms in phone-based queries. They report that most of the queries are simple keyword queries, with only about 10% of queries containing advanced query operators. They suggest that there are significant regional differences, with US based searchers making greater use of advanced query operators. Citing Eastman, C.M. and Jansen (2003), they suggest that most advanced query operators do not increase the precision of the query, so they may not be worth the trouble. Other researchers suggest that search engine users typically evaluate the results of a specific search quickly, before either clicking on a result or refining their query, with an average of 7.78 seconds reported by Granka, L.A., Joachims, T. Gay, G. (2004).
Historians search behaviour on the web in academic literature
Duff, Wendy M. and Catherine A. Johnson, Accidentally Found on Purpose: Information-Seeking Behaviour of Historians in Archives, The Library Quarterly Information, Community, Policy, vol. 72, no. 4 (Oct. 2002), pp.472-496
Foster, Allen and Nigel Ford, Serendipity and information seeking: an empirical study (2003), Journal of Documentation, Vol. 59 Iss: 3, pp.321 - 340
Number of search terms used in MarineLives wiki search box
Type of search terms used in MarineLives wiki search box
List of search terms used in MarineLives wiki search box
The following list contains the search terms used in 275 searches performed using the MarineLives wiki search box, June 3rd 2016 - July 3rd 2016:
Capitalisation and spacing reproduced as used in search box
Search terms in inverted commas reflect use of inverted commas in orginal search
blessing ribert morris
captain "robert morris"
captain robert morris
hca 13/65 f.87r
hca 13/68 f.145v
hca 13/70 f.427
hca 12/70 f.427v
hca 13/70 f.711r
hca 13/72 f.134v
Hca 13/73 f.711r
Hca 13/73 f.711r
Hca 13/73 f.711v
HCA 13 73 f.711r
oliver tanner anglo dutch war
peter and john
PROB 11/361 King 125-176
PROB 32/6/8 Deceased: Bushell. Benjemen. Stepney. Middx Inventory
Ross keel amsterdam
Comparing Google search constrained to MarineLives and MarineLives in-wiki search engine
The use of inverted commas around a multi-term search query in the MarineLives wiki will force a search for the exact phrase so marked up
For example, "Price of Pepper" will only return results with the phrase Price of Pepper, and will omit results just containing Price, Pepper, and of
The same is true of Google search.
A focussed search can be performed in Google looking largely at MarineLives wiki pages by including in inverted commas the term "MarineLives". The results will be almost entirely from Google indexed pages from the MarineLives wiki, but will include relevant results from the MarineLives Wordpress blog The Shipping News and any third party pages which contain the word MarineLives, but not Marine Lives
The examples below compare the results generated by querying the MarineLives wiki and querying Google, using identical search terms, but with the addition of "MarineLives" to the Google query.
- Aula, Anne, Rehan M. Khan, Zhiwei Guan, How does Search Behaviour Change as Search Becomes More Difficult, CHI, Atlanta, Georgia, April 10-15, 2010, viewed 05/07/2016
- Eastman, C.M. and Jansen, B.J., Coverage, relevance, and ranking the impact of query operators on web search engine results, ACM Transactions on Information Systems, 21 (4), 383-411
- Granka, L.A., Joachims, T. Gay, G. (2004) Eye-tracking analysis of user behaviour in WWW search. Proc. SIGIR 04, 478-479