Last month I have been attending the RittmanMead BI Forum 2014. In the wrap-up I mentioned a presentation by Andrew Bond & Stewart Bryson. They had a very nice presentation about the Oracle Information Management Reference Architecture. This needed some further investigation from my part.
This blogpost is a first summary of the information I found online so far.
There is a complete reference library of IT Strategies from Oracle. Pay extra attention to the following docs:
- Oracle Reference Architecture Information Management
- Oracle Reference Architecture Business Analytics Foundation
- Oracle Reference Architecture Business Analytics Infrastructure
- Oracle Reference Architecture Service Orientation
- Oracle Reference Architecture Security
- Oracle Reference Architecture Engineered Systems
Next to that, Oracle has a few White Papers focussing on Information Management (Big Data & Analytics):
Oracle Information Management – Logical View
Let’s take a closer look, see the picture below. This picture has been copied from; the Oracle Information Management Reference Architecture. presentation I mentioned above. There are a few major components in the Reference Architecture
- Data Sources
- Information Provisioning
- Data Ingestion
- Logical Data Warehouse
- Information Delivery
While putting together this blog post, Mark Rittman posted the following article(s); “Introducing the Updated Oracle / Rittman Mead Information Management Reference Architecture – Pt1. Information Architecture and the “Data Factory” & Pt2. – Delivering the Data Factory” on the RM Blog.
Within this Reference Architecture Oracle should be able to handle all sorts of data:
- Traditional Enterprise Data (ERP, CRM, etc.)
- Machine-generated /Sensor Data (Smart Meters, Equipment Logs, etc.)
- Social Data (Twitter, Facebook, etc.)
The last couple of years / decades, Data has changed. We (as BI/DW-Consultants) have always looked at data in a Traditional (Structured) way. Now Oracle provides an Architecture to combine the old Data with the new (Big) Data. Whether there are 3, 4, 5, or whatever number of V’s, the most important thing is that you get value from your Data!
Data Ingestion (Loading)
Oracle provides / supports tools to perform Batch or (Near) Real-Time Data Ingestion.
Oracle GoldenGate & Oracle Data Integrator
Michael Rainey is writing an excellent series on how to load the Raw Data Reservoir (Staging Layer) and the Foundation Layer; Oracle GoldenGate and Oracle Data Integrator – A Perfect Match in 12c.
Cloudera Distribution including Apache Hadoop (CDH)
Hadoop is Open Source and Apache licensed. Cloudera provides an Open Source Distribution ( Cloudera CDH – Cloudera Enterprise Data Hub).
Oracle Event Processing (OEP)
OEP is a complete solution for building applications to filter, correlate and process events in real-time. Check the Data Sheet here.
Logical Data Warehouse
There are a few different Layers:
- Raw Data Reservoir (Staging Layer)
- Foundation Layer
- Access and Perfomance Layer
Depending on the requirements and the tooling one can decide to skip or combine the different Layers. Oracle has the tooling to load the various Layers directly (GoldenGate).
The data is no longer only stored in the Oracle Database. The ‘new’ types of data require ‘new’ types of storage. There are different Data Stores for different purposes.
- Historical (Historical Integrity)
- Analytical (Ease of Acces & Query Performance)
The Oracle Big Data Connectors can be used to integrate Apache Hadoop with Oracle Database Software.
Processing the data could be either In-Database (Oracle Database Options – Advanced Analytics, OLAP) or In-Memory (Oracle TimesTen).
Oracle is able to support proven answers to known questions via Oracle BI. Fast answers to new questions are provided by Oracle Endeca Information Discovery.
“The Oracle BI Foundation Suite provides comprehensive capabilities for business intelligence, including enterprise reporting, dashboards, ad-hoc analysis, multi-dimensional OLAP, scorecards, and predictive analytics on an integrated platform”
OBIEE makes it possible (Logical Layer) to skip the ‘Access and Perfomance Layer’ and source directly form the ‘Foundation Layer’. There are some very interesting presentations online around this subject.
“Oracle Endeca Information Discovery is a complete enterprise data discovery platform that combines information of any type, from any source, empowering business user independence in balance with IT governance. Now organizations can access the information they need, when they need it, to make business decisions they can trust.”
There is still a lot more to investigate, but for me this gives a little bit more guidance.