The past two weeks I had a “presentation skills training” where we had to choose a topic that was completely new to us. Because I was quite interested in looking at a tool most of us use every day, I decided to take a closer look at the Anatomy and Workings of Search Engines. Since one of those search engines has become a common verb in most languages, let’s start from the Oxford English Dictionary’s definition of “to google”:
1. intr. To use the Google search engine to find information on the Internet.
2. trans. To enter (a search term) into the Google search engine to find information on the Internet; to search for information about (a person or thing) in this way.
Since people use Google and other search engines regularly, they tend to rely on four common assumptions as identified by Bettina Fabos in 2003:
- Search engines are impartial information tools.
- Search engines search the entire Web, gleaning the most relevant results.
- Search engines vary greatly, thus offering choice and a competitive marketplace.
- Search engines are the only place to go for relevant information on the web.
Before I can contradict these four assumptions, it is important to understand the structure of the search engine industry.
Search Industry Structure
LESSON 1: Search Engines are not impartial, they are part of an industry.
LESSON 2: Search engines do not search the entire web, especially not the deep web. Some directories only include paid-for content.
Google’s initial strategy was selling their technology as a search engine provider not only to other search engines, but also to other websites that use Google’s technology to power search on their own website. This strategy brings in some money, but it’s not a continuous flow of cash. Therefor, some other search engines chose to focus more on marketing agencies trading for space thus including sponsored links. Within the search industry structure, it is clear that the best way to appear in search results over different search engines, is to pay in order to add (commercial) content to the repository. However, in order to determine the efficiency of advertisements, search engines and marketeers have agreed to a pay-by-performance strategy, which means advertisers only pay the search engine when someone actually clicks on a link. Other search engines have gone for more aggressive strategies such as paid inclusion, where the advertisement appears in every search, but had to sacrifice users because of it.
LESSON 3: There is little competition in the marketplace, with mainly American companies dominating the search industry.
The search engine industry used to be dominated by three main companies: Google, Yahoo!, and Microsoft. Add to that the current trends in voice search and as this infographic beautifully demonstrates. Amazon might not be the richest company, but they certainly have the first-mover advantage and are quickly connecting their Alexa voice assistant to cars and home devices, effectively taking over parts of our lives.
This beautiful infographic also shows the challenges related to monitising voice search. Noble as the goals of Google may be in its promises of search engine integrity, the main thing that counts at the end of the day, is money. The reason Amazon holds such a large segment of the market, could be due to the connection with their online store possibilities and connections to apps and services such as music streaming services.
LESSON 4: There are other places to go for information other than search engines. - perhaps the library? -
Independent institutions such as the university, but also public libraries should warrant access to knowledge that does not necessarily have a commercial value. Therefore, these public institutions need to counteract the commercial interests of search engines by focussing on Open Access. In this regard, the Digital Humanities field is one of the first and most important advocates for Open Access. It is this years main theme of the Alliance of Digital Humanities conference in Montréal. Another important yet political and legal institution working on Open Access is the Open Access Infrastructure for Research in Europe or OpenAIRE. Preferably Open Access information is accessible through Google, but we also need to think about back-ups and other ways of disseminating information.