Tuesday, February 2, 2010

HW4 is out!

This is a fun one (I think) ... so I hope you enjoy it. Grab it from the course web page.

Like I said in class, be sure that you start early because if you don't you'll be in at a big disadvantage for the "gaming google" problem.  And, if you're having trouble finding a group of 2-3 people for the "gaming google" problem, just post a comment here and coordinate with people that way.  I'd like everyone to have a partner for it, but if you really can't find one feel free to do it alone.  I'm really hoping that you come up with some creative ways to get ranked highly, so think outside of the box....and talk to me and the TAs if you're stuck.

10 comments:

  1. Hey everyone,

    Since one of the primary goals of the gaming google problem is to simply beat the TAs, I'm thinking it would be a good idea for all of the students to link to the other sites made by students (but not the sites made by TAs). This should give all of the students an advantage over the TAs and make it more a competition amongst us only. Post here if you're interested so that we can organize something, I guess.

    ReplyDelete
  2. Also, some legitimate questions on the homework:

    For googlewhacking, the dictionary listed (Merriam Webster) has an unabridged version whose definitions we can't see, but are told exist. Can we still use those words? In addition, the dictionary includes a number of other "words" which are, in actuality, proper nouns (e.g. city names). Do those count?

    ReplyDelete
  3. To Jon: You can use the unabridged words as well as well as proper nouns. We only ask that your words should have a dictionary entry to prevent you from making up your own words :-)

    ReplyDelete
  4. It seems that at least two of the CMUers are planning to try to maintain there position at the top of "rankmaniac" ... and have even added a "rankmaniac 2010" to their pages!

    ReplyDelete
  5. This comment has been removed by the author.

    ReplyDelete
  6. For the Google whack, I found a query that returned only one result, but at the bottom it says:

    "In order to show you the most relevant results, we have omitted some entries very similar to the 1 already displayed. If you like, you can repeat the search with the omitted results included."

    Do we care about these very similar results? In my case, the similar result is the exact same webpage indexed under the IP address instead of its domain.

    ReplyDelete
  7. To Mariand: Your result would be fine if you can see "Results 1 - 1 of ..."

    ReplyDelete
  8. For problem 2, are we supposed to use the PageRank system with P or G?? (G is the one that taxes each node and adds edges between any two nodes.)

    ReplyDelete
  9. To Joyoung: Use the one with G. That system is guaranteed to produce unique pageranks with no assumptions on the underlying web graph.

    ReplyDelete
  10. Google doesn't want to crawl my website :(
    If anyone can help and add my link to your website:
    http://sites.google.com/site/rankmaniac2010caltech/home
    I can also add link to your website, let me know
    thanks

    ReplyDelete