SAP has just announced SAP HANA Vora. I’m sure you’re wondering what it is and what it can do for your business. So let me tell you.
But before I get into SAP Vora, I’ll give you a bit of context. So let me take a take a step back and mention the different types of data and data temperatures. For the purposes of this post I’ll cover hot and cold data. At a high level, hot data is data which is accessed frequently within the business such as recent transactional data, perhaps coming from an ERP system. Cold data is data which is accessed much less frequently and mostly used to provide context when doing an in-depth analysis such as historical sales data for the last X years.
One of the challenges we all face is combining these two types of data in a meaningful and coherent way without a huge amount of investment from a monetary and time and infrastructure perspective. This is the issue that SAP HANA Vora addresses.
So what is SAP HANA Vora?
SAP HANA Vora is an In-Memory, massively distributed data processing engine for Hadoop. It facilitates the integration of SAP data with distributed data lakes and brings together all types of data (hot, cold, warm, etc.) through one access point.
How is it achieved?
It is achieved through a performance booster add-on for Apache Spark which facilitates HANA connectivity to Hadoop clusters without any data replication. It also supports interactive SQL on Hadoop data and Enterprise Analytics including:
- Hierarchy support
- Units of Measurement (UOM) support
- On the fly currency conversion support
SAP HANA Vora allows us to take real-time hot data from something like SAP HANA and combine it with not-so-hot data in Hadoop clusters through a single entry point.
Crucially this allows data scientists to gain amazing insight through the combining of all these types of data. Imagine being able to query your real-time data and complete complex analytics influenced by massive amounts of historical cold data all through one single entry point.
SAP HANA Vora is a plugin for Apache Spark. It sits on top of Hadoop clusters facilitating a wide range of new features:
- OLAP on Hadoop, providing the ability to do hierarchies, drilldowns and conversions
- Compiled Queries improving the performance of queries in Apache Spark/Hadoop
- Parallel Access giving increased performance between SAP HANA and Apache Spark
- Rule-based DLM allowing Apache Spark/Hadoop systems to participate in dynamic data tiering
- Unified Admin via the SAP HANA Cockpit
- Open Programming language support such as Scala, Java, Python or R.
So there you have it. With SAP HANA Vora you can combine hot and cold data stores together to achieve a new level of insight without a massive investment from the business.