Searching the WWW
Locating the right information on the WWW requires effort

Looking In the Right Place
The WWW is not the first place to look
Go directly to a site -- www.irs.gov
Go to right sight -- dictionary.cambridge.org
Go to the library -- www.lib.washington.edu
Go for the kind of information you want -- www.npr.org
Ask, “What site provides this information?”

Search Engines
No one controls what’s published on the WWW ... it is totally decentralized
To find out, search engines crawl Web
Two parts
Crawler visits Web pages building an index of the content
Query processor checks user requests against the index, reports on known pages

Google Advanced

Boolean Queries
Search Engine words are independent
Words don’t have to occur together
To be explicit about occurrences use Boolean queries and quotes
Logical Operators: AND, OR, NOT
monet AND water AND lilies
“van gogh” OR gauguin
 vermeer AND girl AND NOT pearl

Queries
Searching strategies …
Limit by top level domains or format
Find terms most specific to topic
Look elsewhere for key words, e.g. bio
Use exact phrase only when universal
If too many hits, requery
“Search within results” using “-”
Once found, ask if site is best source

Truth on the Web
Much Web information is wrong
Using the Web effectively means recognizing quality information
Information from reliable organizations is usually preferred -- check out ownership
Look for accuracy, currency, …
Follow links to verify that the content supports the original page

A Bogus Site

True Site, Bogus Implication

Project 1
Create a bogus (fictitious) Web page
To appreciate how easy it is fake quality info you will build a bogus Web page
Modify photograph, changing its meaning
Write misleading text
Add “authenticity” links, fake credentials ...
Your page should look as legitimate as possible, but contain false information

Project 1 Details on Web
You will …
Write page in HTML
Find/take a photo
Modify w/ Photoshop
Find “authenticity” links
Milestones
Turn In January 23: Web page plans
Turn In January 30: Web page + photo

Here’s How It Works
Follow these steps:
Create your page locally
When finished, publish it on the server
Do not touch after the deadline (11:30PM)
Print off a copy and turn it in in class
Leave your page unchanged until grading is complete