Here is example output from the sample solution when run onattu. (There is reason to believe that answers may vary slightly across platforms.)The '>' at "1500 pages..." is a lexer bug. We're not going to let little bugs bother us in this assignment.
You can run the sample executable with no setup required.
sshtoattu,cd /cse/courses/cse303/05au/hw5and then give the command shown below. (Note that the command has two arguments.)
$ ./hw5soln data www.nytimes.com/index.html 100 pages... 200 pages... 300 pages... 400 pages... 500 pages... 600 pages... 700 pages... 800 pages... 900 pages... 1000 pages... 1100 pages... 1200 pages... 1300 pages... 1400 pages... >1500 pages... 1600 pages... 1700 pages... 1800 pages... Done: 1898 pages, 1464405 words, 161944 links scanned 54432 distinct words Enter single keyword query> jello 1 matches 1 www.nytimes.com/2005/10/21/arts/music/21pop.html Enter single keyword query> bizarre 22 matches 1 www.nytimes.com/2005/10/20/arts/dance/20lees.html 1 www.nytimes.com/2005/10/20/fashion/thursdaystyles/20CRITIC.html 1 www.nytimes.com/2005/10/21/nyregion/metrocampaigns/21clinton.html 1 www.nytimes.com/2005/10/22/arts/television/22sher.html 1 www.nytimes.com/2005/10/23/books/chapters/1023-1st-levy.html 1 www.nytimes.com/2005/10/23/books/review/1023bb-paperback.html 1 www.nytimes.com/2005/10/23/books/review/23miller.html 1 www.nytimes.com/2005/10/23/theater/newsandfeatures/23mill.html 1 www.nytimes.com/2005/10/25/arts/music/25bare.html 1 www.nytimes.com/2005/10/26/opinion/26wed2.html 1 www.nytimes.com/2005/10/30/books/bestseller/1030bestchildren.html 1 www.nytimes.com/packages/html/sports/year_in_sports/01.21.html 1 www.nytimes.com/packages/html/sports/year_in_sports/07.24.html 1 www.nytimes.com/packages/html/sports/year_in_sports/09.20.html 3 www.nytimes.com/packages/html/sports/year_in_sports/10.30.html 1 www.nytimes.com/packages/html/sports/year_in_sports/11.20.html 1 www.nytimes.com/packages/html/sports/year_in_sports/12.23.html 1 www.nytimes.com/pages/health/psychology/index.html 1 www.nytimes.com/pages/health/psychology/text/index.html 1 www.nytimes.com/ref/books/nonfictionf.html Enter single keyword query> q $