Is Transaction Database good for reporting?

Hi Everybody,

Here is the situation for my reporting requirements. Please suggest me how to proceed. I just have one of two options but which one to go is the question i want to ask all of you. May be you can suggest me something new.

  1. Reports data needs to be almost live.
  2. Report requirements are all Detail reports (not summary report)
  3. Adhoc facility required for business users. Here also they expect live data(almost live)
  4. Assume the number of rows fetched will be approx 100 thousand (initially)

We have two options

  1. we will create a seperate database tables for reporting using DB2 Materialized query tables(MQTs). This needs frequent refresh of MQTs. I am not sure if this refresh might hit the front end application performance.
  2. Use transaction tables for creating universe and then reports. Will the application performance/report refresh time be affected if we use transaction tables?

Your replies are highly appreciated. Thanks


saravanainfo :india: (BOB member since 2006-12-13)

It depends on a bunch of factors.

If you are using a replication tool (like Shareplex from Quest), then it can work pretty well. You can create indices which can improve performance.

The issue isn’t with how much data you are retrieving. The issue is with how much data you are storing and how fast your queries run.

While it’s certainly not ideal, it can be done.


Steve Krandel :us: (BOB member since 2002-06-25)

I wouldn’t go against the transactional database, this is likely to affect both performance of the application and your reports.

Whenever I have had this sort of requirement, generally we have gone with replication, the time period driven by how “live” your data has to be.


Mak 1 :uk: (BOB member since 2005-01-06)

Hi

The well defined database setup will not cause you much on the performance front…

Regards

Siva.M


looksmee :uk: (BOB member since 2006-02-08)

Thanks for all your replies. I am taking all your inputs.


saravanainfo :india: (BOB member since 2006-12-13)

Here’s a thought for you to consider… why do you need 100 thousand rows to answer an adhoc question? :slight_smile:


Dave Rathbun :us: (BOB member since 2002-06-06)

:smiley: :smiley: :smiley:
No. of rows is the max no. of rows that will be available during the initial phase


saravanainfo :india: (BOB member since 2006-12-13)

So, if the whole transaction database is only 100,000 rows, performance should not be an big problem.


Steve Krandel :us: (BOB member since 2002-06-25)

Initially yes. But later on the number of records will increase drastically. Still it is recommended?


saravanainfo :india: (BOB member since 2006-12-13)

The comment was because of the above statement, taking a couple of precaution related to the query performance will not hit you much…

Regards

Siva.M


looksmee :uk: (BOB member since 2006-02-08)

Then, how many rows available in the transaction database (approx) will/may affect the performance of report/application. Because i came to know that the Initially the data available in Transaction database can be approx 10 million rows :frowning:
Sorry for this delayed update. Comments again please.


saravanainfo :india: (BOB member since 2006-12-13)

Think of it this way:

A transactional database is designed for efficiency of data capture.

A reporting (data mart/warehouse) is designed for efficiency of data retrieval.

There is a reason for both. Although reporting against transactional databases can be done, the complexity and size of this database really doesn’t make it an ideal database for reporting. Also, think of how the transaction system can be affected if a query locks a table during reporting. Now, I am sure they wouldn’t be too happy with that either.

Usually the reasons for not wanting a proper data mart or warehouse are the effort and cost involved in creating it. However, a poorly designed reporting database tends to be much more expensive in the long run.

Hope this helps,

Mitra


Mitra Moini (BOB member since 2002-09-01)

For exteremely large ‘Data Vaults’ - look at Teradata - or alter your thinking re storing ‘everything’ in a typical ODS/DWH scenario.

i.e. this topic makes me think of two shopping scenario’s - a bachelor walking down the aisles with a trolley grabbing what he thinks he might need, compared to a women with a basket and a list that knows exactly what she wants and how much each item should cost.

It all goes back to requirements and costs - anything is possible these days with the technology that’s available and the old debates around ODS/DWH/EDH/ETL/Star Schema etc are busy being shaken up by architectural changes that are revamping DWH and BI as we know it.

“Real-time is anything that is too fast for your current ETL” - Ralph Kimball.

Newcomers to DWH should be discussing CTF, SOA (yeah I know - last years buzzword), MBETL, SETL, EII and Step Dimensions etc.
http://www.docstoc.com/docs/479128/Considerations-for-Building-a-Real-time-Data-Warehouse

:yesnod:
:wink:


MikeD :south_africa: (BOB member since 2002-06-18)

Hi

The Teradata is among the best in RTDW. We had a discussion on implementing the RTDW and wind up with the conclusion “Its cost you more…”

http://www.teradata.com/t/page/148873/index.html

Regards

Siva.M


looksmee :uk: (BOB member since 2006-02-08)

Mike, Amen to that. An extendable model of the true business process’s is what we shoiuld all be aiming for… 8) .


Mak 1 :uk: (BOB member since 2005-01-06)

… that was Mitra …


MikeD :south_africa: (BOB member since 2002-06-18)

Oops sorry Mitra!!

Bit bleary eyed this morning… :oops:


Mak 1 :uk: (BOB member since 2005-01-06)

And what is wrong with that? Probably why most women manager their houshold budgets :wink:

On the serious side, I agree that the technology is bringing these concepts closer. However, it still boils down to real requirements (vs. wish lists) and cost. Afterall, a Lamboughini would be awesome to drive but if I can’t afford the car or the fuel it needs or won’t transport my soccer team to practice, then a minivan would be just fine as well.

Mitra


Mitra Moini (BOB member since 2002-09-01)

Uh, what kinda car is that? Is that the chinese rebuild of Lamborghini :wink:


Andreas :de: (BOB member since 2002-06-20)

Appreciate your comments. But here, we have limitations. We can’t get Teradata or anyother database. The only option is DB2. The budget is very low and they are not looking for anyother software/tools. Also, we tried convincing them to have seperate database which is refreshed every night. But they want live data for most of the reports and also for adhoc reporting.


saravanainfo :india: (BOB member since 2006-12-13)