Add Value To Beat 'Googlization' > > Intelligent Enterprise: Better Insight for Business Decisions

Welcome Guest. | Log In| Register | Membership Benefits

Intelligent Enterprise

Better Insight for Business Decisions

Intelligent Enterprise - Better Insight for Business Decisions
search Intelligent Enterprise
Home
Digital Library
Events
RSS | Newsletters
Webcasts


  • EMAIL
  • PRINT
  • REPRINTS
  • Follow Us on Twitter
  • FOLLOW US
  • Share

Add Value To Beat 'Googlization'


Publishers are adding deeper information to their services.


By Doug Henschen
February 1, 2005

Google keeps raising search expectations, announcing in December, for example, that it will scan and full-text index books from the libraries of Harvard, Stanford, Oxford, the University of Michigan and the New York Public Library. Publishers of proprietary information, meanwhile, decry what Amanda Spiteri calls the "Googlization" of research. "There's a growing mentality, particularly among the younger generation, that if it doesn't come up in an Internet search engine then it doesn't exist," says Spiteri, marketing director of Elsevier's ScienceDirect, a subscriber-based service for libraries, universities and research institutes.

To counter this attitude, Elsevier and other publishers are adding deeper information to their services. ScienceDirect has evolved from a Web database of Elsevier scientific journals and now links to 170 other publishers and a total of 1,800 scientific, technical and medical journals.

In 2001, Elsevier launched Scirus.com, a free search engine that indexes 167 million science-related Web pages, including millions of .edu, .org and .gov sites including Elsevier and other proprietary sources. And in the past year, Scirus has focused on indexing more top-quality science sources, such as the journals of the American Institute of Physics.

"It's often difficult for users of public search engines to distinguish good information from bad," says Susan Vugts, marketing manager for Scirus. "Our crawlers look specifically at scientific sources, and we developed our own scientific taxonomy that improves the accuracy of indexing and search results."

At Thomson Finanical, a supplier of real-time corporate data to the financial services industry, Internet competition from sources such as the SEC's Edgar database have led the company to add more detail. Thomson republishes all Edgar data, including XML, HTML and JPG attachments, as soon as it's available, but it also converts that data into ASCII and adds refined indexing.

"Edgar content has some tags, but they're not consistent, and they don't give our customers what they want from income statements and other documents," says Mary Ann Wismer, Thomson Financial director of document management. "Indexing is critical in terms of timeliness and making sure that facts are findable, so we add 15 to 20 index values to every document type. "


  • EMAIL
  • PRINT
  • REPRINTS
  • Follow Us on Twitter
  • FOLLOW US
  • Share


 





New on the BLOG
Is Gartner's Quadrant the Problem, Or Is It How It's Used?
02. 8.2010
blog author
Cindi Howson
Bashing Gartner's Magic Quadrants seems to be a popular industry pastime, but in truth, I kind of like the quadrants. My biggest gripe is in how the quadrants are used, not necessarily the quadrants themselves...

Read more from Cindi Howson >>

Seth Grimes
Clarabridge Asks, Are You Customer Experienced?
Add "customer" to Jimi Hendrix' song title and you have a question central to last week's Clarabridge Customer Connections (C3) conference, Are You Customer Experienced?

02. 5.2010
Read more from Seth Grimes >>

Quick Thoughts on Sybase/Aleri
02. 4.2010
blog author
Curt Monash
Sybase today announced an asset purchase that amounts to a takeover of CEP (Complex Event Processing) vendor Aleri, which last year acquired Coral8. Quick reactions include...

Read more from Curt Monash >>



Intelligent Enterprise Newsletters
Subscribe Here:
*Email:
 First Name:
 Last Name:
  Intelligent Enterprise Blogosphere Newsletter:
  Intelligent Enterprise Newsletter:

Email Type: