The basic process would be:
- Get a basic semantic picture of the domain (i.e. what items are of interest? What attributes distinguish them? Who is active in the area? Who blogs about it?)
 - Crawl the Web by means of link following, search term identification, and so on.
 - Find the valuable data sources and determine the database schema that best encodes them.
 - Scrape data on a periodic basis into your database, and make sure you have keyword-rich databases available.
 - Provide some kind of commenting/forum functionality.
 
There are, of course, lots of this kind of thing out there - they all suck.  A small community of devoted fans can make or break this; you can become the definitive guide to a very small domain.  Monetize with ads and with sales.
No comments:
Post a Comment