Results 1 to 11 of 11
  1. #1
    Join Date
    Feb 2001
    Location
    West Michigan, USA
    Posts
    9,675

    Wayback Machine (archive.org) being sued

    ||| 99.999% Uptime SLA!!!
    Plenty of space and bandwidth to fit your needs!
    www.AEIandYou.com - - (WP Friendly - Premium Reseller Hosting and Cheap Dedicated Servers)

  2. #2
    Join Date
    Feb 2002
    Location
    Australia
    Posts
    24,009
    Holey Moley Batman.
    AussieHost.com Aussie Bob, host since 2001
    Host Multiple Domains on Fast Australian Servers!!

  3. #3
    Join Date
    Feb 2001
    Location
    West Michigan, USA
    Posts
    9,675
    I love archive.org - visit it often. Actually grabbed a pre-oops copy of one of my webpages from it recently. That said, I always wondered how they got around copyright issues?

    --Tina
    ||| 99.999% Uptime SLA!!!
    Plenty of space and bandwidth to fit your needs!
    www.AEIandYou.com - - (WP Friendly - Premium Reseller Hosting and Cheap Dedicated Servers)

  4. #4
    Join Date
    Feb 2002
    Location
    Australia
    Posts
    24,009
    I love archive.org too, although it can be a tad emabrrassing, seeing some sites you built 5 years ago. Crikey!
    AussieHost.com Aussie Bob, host since 2001
    Host Multiple Domains on Fast Australian Servers!!

  5. #5
    Join Date
    Sep 2002
    Location
    Western Canada
    Posts
    1,888
    I'm surprised it took this long.

  6. #6
    Join Date
    Nov 2002
    Location
    WebHostingTalk
    Posts
    8,878

    Re: Wayback Machine (archive.org) being sued

    Originally posted by AH-Tina
    http://yro.slashdot.org/article.pl?s...27236&from=rss

    Criminy!

    --Tina
    <sigh> Sad....

    Sirius
    I support the Human Rights Campaign!
    Moving to the Tampa, Florida area? Check out life in the suburbs in Trinity, Florida.

  7. #7
    Archive.org is nice and all but, if they didn't follow the robots.txt files then I believe they are in the wrong there...
    Let Google know when your dead | Read More >>
    Submit Your Press Release, Article, or News Tip Today
    W3Reports - News for Webmasters

  8. #8
    Join Date
    Oct 2004
    Location
    San Francisco, CA
    Posts
    2,454
    Originally posted by Trifolic
    Archive.org is nice and all but, if they didn't follow the robots.txt files then I believe they are in the wrong there...
    Oh but they do!
    http://web.archive.org/web/*/http://vbulletin.com

    Then take a look at their Robots.txt
    http://www.vbulletin.com/robots.txt
    Tyler Cole
    Eeek, a Blog

  9. #9
    Originally posted by Tyler
    Oh but they do!
    http://web.archive.org/web/*/http://vbulletin.com

    Then take a look at their Robots.txt
    http://www.vbulletin.com/robots.txt
    But on at least two dates in July 2003, the suit states, Web logs at Healthcare Advocates indicated that someone at Harding Earley, using the Wayback Machine, made hundreds of rapid-fire requests for the old versions of the Web site. In most cases, the robot.txt blocked the request. But in 92 instances, the suit states, it appears to have failed, allowing access to the archived pages.

    http://www.nytimes.com/2005/07/13/te...rssnyt&emc=rss

    So apparently you can brute force your way into IA archives...

    Either way, just because Wayback denies access, doesn't mean they followed the robots.txt file if they still have it archived somewhere...but blocked to the public..
    Let Google know when your dead | Read More >>
    Submit Your Press Release, Article, or News Tip Today
    W3Reports - News for Webmasters

  10. #10
    Join Date
    Sep 2000
    Location
    Alberta, Canada
    Posts
    3,109
    Although it's true the site does follow the rules laid out in your 'robots.txt' file, we've been blocking them for years, it blew me away when I found out they were stealing our images/content to display on 'their' Website.

    When someone, for whatever reason, takes images/content from your site without asking or even letting you know about it, that is called theft. Perhaps if they had been a bit more polite about what they wanted to do it could have turned out good for everyone.
    PotentProducts.com - for all your Hosting needs
    Helping people Host, Create and Maintain their Web Site
    ServerAdmin Services also available

  11. #11
    Join Date
    Nov 2002
    Posts
    4,377
    When someone, for whatever reason, takes images/content from your site without asking or even letting you know about it, that is called theft. Perhaps if they had been a bit more polite about what they wanted to do it could have turned out good for everyone.
    Every visitor to a webpage takes images and content and stores them - without asking - on their PC. (actually they don't even take them - they make a request and your webserver sends them)

    Isn't robots.txt just a request without any legal status?
    If you put a webpage on the web and it isn't password protected then it's fair game...within copyright laws of course.

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •