home

fotolog archiver

Download Latest (1.9)

news

2007/09/30
version 1.9 is here!
  • this version is much cleaner, thanks to fotolog's html cleanup. they're serving much cleaner html pages which made my job easier. if you've been having problem archiving your fotolog using 1.8 it was because of this change in their html. the problem should be fixed now.

    what's new in 1.8:

  • fixed guestbook download code. the previous version wasn't grabbing the correct guestbook html.
  • corrected styles in the guestbook comments. now the downloaded fotolog should look much more like what it looks like online.
  • filtered out gc user icons. i wasn't able to line these up right so they're just filtered out for now.
  • code clean-up. it should be easier to read and hack by other users.

    known issues:

  • if you happen to run the script at an "inopportune" time, you may download guestbook messages that say something like

    "Wow! Fotolog is unusually busy right now and the servers were unable to load this guestbook. Please try again in a few moments."

    if this happens just stop the script and run the script a bit later.

    Sept 9, 2006


    fotolog archiver 1.7 is here! Download it here (you may have to right-click on the link to save it on your computer).

    This version works with fotolog as of Feb. 23, 2006. It still has the following issues, which're still correctible after you download the main files:

  • The guestbook scrollbar is back. Does anyone know how to get rid of it? The guestbook is in an iframe currently. One workaround would be to include the guestbook directly in the main html page.
  • will, 02/23/2006


    Prerequisites: The script uses Python 2.4 API. If you don't have it installed on your system you can get it here.

    If you're on Mac OS 10.3 then you probably have Python 2.3, which should work also.

    This is a python script that can crawl a fotolog user's account and download the HTML pages, the images, and the guestbook entries to the user's computer. The script was tested on windows 2000 and XP, using python 2.4.

    Once you have Python 2.4 installed on your system, you can invoke the getfotolog.py script by adding its path after the python executable. For example, on my windows 2000 computer I would type:

    c:\Python24\python.exe getfotolog.py

    The script will then prompt you for a starting fotolog page. To archive all your fotolog pages you would just enter your very first fotolog page. If all goes well the script will begin to download that page, the main photo on the page, and all the guestbook entries, and saving them in a directory or folder that's named after your fotolog user name, in the directory/folder you run the script from.

    In addition the script will follow the 'next >>' link on the page and continue to retrieve the next page, and the page after that, etc., etc...

    To view the archived fotolog pages simply go into the directory names after your fotolog user name and load up any .html file in there. The next and previous links have been updated to point to the local copies of your fotolog pages, so you can navigate around.

    Because of the nature of the internet and the fotolog servers, I've put in some delays between page requests just so the script won't put too much load on fotolog and your own computer. Also the script is smart enough to detect when fotolog is being difficult and will retry. It's actually quite persistent and won't leave you with any blank images or empty guestbooks (unless they were blank originally). Also you might not be surprise to know that it's faster to get your pages from fotolog during off-peak hours. So I'd recommend running the script before you go to sleep.

    Finally the script is protected under the MFJ license. Please read it before you decide whether you want to use this script or not.

    will luo, 03/30/05, 3:00am