I'm a developer and I wrote a perl/bash script that connects to the official WoW armory and downloads XML files with details about each guild and players in each guild. The current process takes about 24 hours to run the guild checker script. This is how it works:
There is one flat file which has definitions of what to search for. I generated a list of guilds on the American servers:
Code: Select all
[ronnie@ronalddove01 advanced]$ cat hitlist | wc -l
24543
[ronnie@ronalddove01 advanced]$
Phase 1: The python script crawls through the armory looking for every guild and downloads the member list of each guild. It stores a database of players based on what class they play. This process takes 24 hours to download every player from the 24k guild list. For example this generated about 100k of night elf warriors of the level 80 of female status for the entire list.
Phase 2: Downloads every player from the new generated lists in to a directory. This can be used for parsing statistical details about reputation etc. This process takes 72 hours or more based on how many downloads are required.
These processes could be ran once a month for a better idea of statistical details.
New technology bot: Will download entire armory by linking players to players and roaming the entire WoW armory until it gets every players details.
The only problem I see with the bot is if blizzard bans it from downloading this much data.
Code: Select all
[ronnie@ronalddove01 archive]$ ls | wc -l
12258
[ronnie@ronalddove01 archive]$ du -h
2.3G .
[ronnie@ronalddove01 archive]$
Ronald Dove
http://www.dovestech.com