CSC 379 SUM2008:Week 2, Group 3: Difference between revisions

From Expertiza_Wiki
Jump to navigation Jump to search
Line 49: Line 49:
==External Links==
==External Links==
[http://www.infotoday.com/searcher/may01/liddy.htm How a Search Engine Works]
[http://www.infotoday.com/searcher/may01/liddy.htm How a Search Engine Works]
[http://realtytimes.com/rtpages/20010807_searchads.htm Search Engine Ads Stir Controversy]

Revision as of 21:30, 18 July 2008

Search Engines

Search engines fill an important role in our lives, helping us locate information within a wide array of multimedia. However many ethical considerations are involved in their operation; the ordering of rankings, the range of content indexed (or not), and how advertisements are incorporated, are a few. Broadly examine the ethics of search engine operation and use.

Function

A search engine is an information retrieval system that match queries with an index it creates. Search engines consist of four essential modules:

  1. Document Processor - this prepares, processes, and inputs the documents, pages, or sites that users search against.
  2. Query Processor - this consist of seven possible steps
    • Tokenizing usually by breaking inputs into strings separated by white space.
    • Parsing operators like reserved punctuation or reserved terms in specialized format (e.g., AND, OR). This may also include boolean, adjacency, or proximity operators.
    • Stop list and stemming might contain words from commonly occurring querying phrases. Engines may drop these two steps.
    • Creating the query depends on the method used to do the matching.
    • Query expansion employs synonyms to optimize the search results.
    • Query term weighting is used to judge the importance of each term in the query.
  3. Search and Matching Function - this is based on which theoretical model of information retrieval underlies the system's design philosophy.
  4. Ranking Capability - this is done several ways
    • Term frequency
    • Location of terms
    • Link analysis
    • Popularity
    • Date of Publication
    • Length
    • Proximity of query terms
    • Proper nouns

Issues

Algorithms

The algorithms that were initially used were fairly basic and objective. They were based such criterion as number of visits to a page or the number of pages that link to a certain page. The success and popularity of Google is attributed to their more subjective approach. Google implemented an algorithm that is meant to find what users are looking for instead of offering the most popular results, which is not always the same thing. The secrecy of their algorithm is what gives Google their competitive edge.

Advertisements

Including advertisements on a search engine is nothing new. For years, these ads had no affect on search results. Advertising practices include:

  • Paid placement is advertising that is outside of the editorial content of the search results, sometimes above or below the editorial content, or in a sidebar.
  • Paid inclusion is advertising within the editorial content of the search results, though it does not necessarily guarantee a certain position within the results.
  • Paid submission is the practice of requiring payment to speed up the processing of a listing, though it rarely guarantees that a site will in fact be listed by the search engine.

The inclusion of advertisements in search results has given rise to many fair use and comparative advertising issues.

Politics

Suppressed Content

External Links

How a Search Engine Works Search Engine Ads Stir Controversy