Question : automating file operations, dynamic internet search and download

i need to do the following many times, so i really need to automate the following procedure (or at least parts of it) and my question is how to achieve that:

---

specify a folder and put the name of the first subfolder into a variable (e.g. FilmName = "The Shawshanks Redemption (1994)")

strip the variable from special characters, remove occurences of "the" and take say the first two words of what is remaining (e.g. FilmKeywords="Shawshank Redemption")

now enter that stripped film name into imdb search and make sure you get the film page (i.e. http://www.imdb.com/title/tt0111161/)

save that page as a local html only (not complete) file in a specified folder on my harddrive

now take the stripped variable name into search in http://www.subtitles.de/main.php?l=search and find if there is a german subtitle for it (i.e. http://www.subtitles.de/upload/german/Shawshank%20Redemption.zip )

download teh subtitle into a specified folder on harddrive and unzip it and rename that file to the stripped variable name

finally, find a trailer corresponding to the stripped varialbe name, one potential way to do that is to follow the trailers and videos link in the imdb film page (e.g. http://www.imdb.com/title/tt0111161/trailers) and take the first hit and use the firefox extension DownloadHelper (http://www.downloadhelper.net/install.php) to download the flash or quicktime file as a local trailer clip  (i can see that this is probably the hardest part to automate)

... and now repeat the process, taking the next subfolder (i.e. another film), downloading the imdb film page, the german subtitle and a trailer for it :)

---

my current general ideas how to potentially achieve that is to either use Windows Script Host or to use VB.NET . Either way could you point me to the main functions i would need to use in order to work with folders on my harddrive, search the internet , download from internet and interact with DownloadHelper or similar ?

Answer : automating file operations, dynamic internet search and download

You might get a better response if you include this in a .Net Zone.

I would also split it up. Your questions is big and covers several skill sets which might scare Experts away.
  • Reading sub folder names 
  • Removing Stop Words from strings 
  • Making web requests and saving the resultant page/file 
  • Scraping web responses to find the subtitles URI 
  • Unzipping files 
  • Downloading trailers - I don't think what you suggest is possible 
Several of these would have ready made answers in EE or via Google.
Random Solutions  
 
programming4us programming4us