3FG wrote:... includes both long and short forms of the links, nulls removed, web page descriptions added. Downloading takes about twice as long, because the description is on a different page than the actual file, so there are double the number of pages to download.
We don't really need the long description, we need the short description because it's how we connect the file that we're looking at back to the one listed on the web.
You shouldn't need to go to any extra pages to get this. I assume you start by going to the main index page for the category (eg,
here for cameras). This page lists the short descriptions (ie, the ones we need) along with various other bits of info, such as when it was last updated and the icon, and the URL. My suggestion would be to use this info to start the table of data. Then you can start downloading the files to get the files and the real file names.
Here's a list of the data that I'd like to see collected:
1) Category (eg, Cameras)
2) Short description (eg, Aiptek Action HD GVS HD camcorder)
3) File id (eg, 6119, don't need the full URL)
4) Icon GIF (eg, rm, km-xls, xls, zip, txt, etc)
5) Date (eg, 16 Jan 2009 03:56 pm)
6) Real file name (eg, Aiptek Action HD GVS.rmdu)
7) File type (eg, rmdu, zip, txt, etc)
3FG wrote:Still doesn't handle files with semicolon separators, combo protocols, or zip files.
Maybe you could add an extra column where you list why you didn't process the file, as this will help us go back and clean them up.
Also, could you start thinking about what it would take to have the tool run totally automatically. I'm thinking that it would be great if we could just click a button and it would download all of the files from all of the folders in the Device Upgrades section (apart from the WAV folder) and then automatically process them, with the results ending up in one big csv master file.
I'm thinking that I'll use the results that I have now to go through and un-zip files or update old KM files with the semi-colons, etc and then when I'm done, I'd like to be able to just re-run without needing to do all the "hand holding" that the app currently requires.
Btw, I'm already thinking ahead to other ways that we can use this app. I think there are still several files over in the old Yahoo groups that have not been ported, so there might be a use for an adapted version of the app even after we're done with this particular project. Hey, maybe it could also download all of the CCF files from the Pronto section at Remote Central with one button click! And maybe it could automatically run John's DecodeCCF against them. See what you've started!
