Data Warehouse Alternatives Make a Hit in India and Europe > > Intelligent Enterprise: Better Insight for Business Decisions

Welcome Guest. | Log In| Register | Membership Benefits

Intelligent Enterprise

Better Insight for Business Decisions

Intelligent Enterprise - Better Insight for Business Decisions
search Intelligent Enterprise
Home
Digital Library
Events
RSS | Newsletters
Webcasts


  • EMAIL
  • PRINT
  • REPRINTS
  • Follow Us on Twitter
  • FOLLOW US
  • Share

Data Warehouse Alternatives Make a Hit in India and Europe


Telco Reliance scales with DW appliances. Web marketer TradeDoubler loads and queries faster, cuts cost with a column-store database.


By Doug Henschen
June 20, 2008

The Need for Speed

Scalability was decidedly not the problem facing TradeDoubler. In fact, the Web marketing firm's warehouse was less than one terabyte, but complex analytic queries against as many as 3 billion rows of data demanded extensive aggregation. What's more, since the firm studies constantly changing clickstream data, the database had to be continually rebuilt, reindexed and tuned.

"You have to structure the database to be able to ask the questions, and that takes a lot of work," says CTO Ola Uden. "We had a one person working with the data full time, but depending on the complexity of the queries, it took anywhere from half a data to two days to get the data out."

Early this year TradeDoubler implemented the Brighthouse column-store database from InfoBright. Column-oriented databases are faster than conventional (row-oriented) databases in many analytic applications because they can query selected attributes without wading through all the non-relevant data in each row. Leading column-store databases are also designed to take advantage of commodity hardware supporting massively parallel processing. TradeDoubler is running Brighthouse on an inexpensive ($12,500) Dell server with two quad-core processors.

TradeDoubler optimizes Web marketing campaigns across Europe and Asia for more than 1,600 advertisers by analyzing Web clicks, impressions and purchases. Customers include online retailers such as Apple and Dell. Brighthouse and Pentaho BI software are serving as the engine behind TradeDoubler's TD Integral Cross-Media Marketing Platform, which is designed to "understand the complete customer journey" across search engines, affiliate sites in TradeDoubler's network and online advertising."

"If Apple is running a campaign for the iPhone, they want to look at how people ended up buying one at their site," explains Mats Johansson, a senior consultant at Lincube Group AB, which helped TradeDoubler with the Brighthouse implementation. "What did they do before they made that purchase? Did they read a review or were they responding to an ad? Which sites were they visiting and how did they arrive at the Apple store?"

TradeDouble has more than 125,000 Web sites in its network, and it tracks some 20 billion impressions, 265 million unique visitors and 12 million leads per month. The Brighthouse implementation went into production in May, and the firm now loads and rebuilds the database every day, retaining three days of network-wide clickstream data and 60 day's worth of online order information.

TradeDoubler continues to rely on Oracle for many of its transactional processing needs, but constant rebuilding and, in particular, high-volume loading necessitated an alternative approach, says Johansson. "Loading 2 billion rows a day while still maintaining performance on analytic queries would have been quite expensive," he says.

Between faster loading speeds, automated indexing, 30X data compression and faster query times, TradeDoubler is getting faster answers at a lower cost than would have been possible with conventional technology. With appliances, column-store databases and related software-hardware configurations growing in number and diversity (from small-scale to ultra-high-capacity), it looks like the days of building data warehouses from scratch are winding down all over the globe.


  • EMAIL
  • PRINT
  • REPRINTS
  • Follow Us on Twitter
  • FOLLOW US
  • Share


 





New on the BLOG
Compatibility and SaaS Multi-tenancy
09. 1.2010
blog author
Josh Greenbaum
Fellow Enterprise Irregular Phil Wainright has commented on my recent post regarding multi-tenancy with a well-considered rebuttal that I believe deserves its own rebuttal in turn. Here it is:

Read more from Josh Greenbaum >>

James Kobielus
Data Analytics: Key to Process Optimization
Rome was not reinvented in a day. Your enterprise business processes won't turn around overnight either. You'll need to re-engineer processes while you continue to run a business -- albeit one with many buried layers, some splendid ruins, and many construction projects that cause never-ending traffic snarls.

09. 1.2010
Read more from James Kobielus >>

Why HP and Dell are Going Nuts Over 3PAR
08.31.2010
blog author
Rajan Chandras
Of the many reasons why HP and Dell are wooing 3PAR, only one actually has to do with 3PAR... Beyond all the qualitative and quantitative factors, HP and Dell are betting that virtualization and cloud computing are the storage of the future.

Read more from Rajan Chandras >>



Intelligent Enterprise Newsletters
Subscribe Here:
*Email:
 First Name:
 Last Name:
  Intelligent Enterprise Blogosphere Newsletter:
  Intelligent Enterprise Newsletter:

Email Type:





Enabling People and Organizations to Harness the Transformative Power of Technology