Hello all,
I have a screen scraping application (go to a lots of sites, extract 10k
stuff, integrate the results, put them to DB etc). Now i want to use a
Rails application as a frontend to this: The user can push a button
which triggers the screen scraping app and view the results (preferably
asynchronously, but that does not really matter right now).
Questions:
-
Should the screen scraping app reside in the Rails directory structure
or somewhere else (since it can be viewed as a standalone application-
Rails just displays data from a DB, does not necessarily need to know
who and how did generate the DB). OTOH, if it is easier to communicate
if the screen scraping dir is integrated into the Rails directory
structure, i don’t have problem to do that either. In this case, where
should i put the screen scraping code? (We are talking about lets say 50
classes altogether) -
How to trigger the screen scraping in the background - i.e. the user
clicks a ‘start’ button, gets back a ‘screen scraping started’ message
and can work with the web page further, instead of waiting on the result
(since that can a last even several hours in extreme cases)
Thanks,
Peter