All posts by Suresh Narayanappa

Pentaho and Big Data

Suresh Narayanappa

GrayMatter‘s Pentaho Big Data Services provides open source reporting, data mining, and analysis, workflow and dashboard capabilities. The company’s CEO, Quentin Gallivant, made a few predictions relating to Big Data for 2014. In my next blog sometime in May am going to share a unique use case for a Big Data implementation.

The “power curve” of Pentaho Big Data Services will be created by the demand among business users for blending of data. Some clients of Pentaho recently elaborated on their Pentaho projects in London and NY, and it is apparent that their companies are blending relational and big data. Business clients like them are increasingly inspired by the potential to gain newer insights in blended data from a comprehensive 360 degree customer point of view, which includes the capability to analyze customer patterns of behavior and then predict the possibility of customers taking advantage of offers targeted towards them.
Pentaho Big Data Services
Big Data will go mainstream

From a historical perspective, projects related to Big Data have decomposed in the IT department as considerable technical skills are required to deploy them. There are also a large number of mind boggling technologies that can be mixed to construct reference architectures. Clients are required to select among the number of open source and commercial technologies, which include NoSQL databases, analytics platforms, Hadoop distributions, high speed databases and a huge number of plug-ins and other tools. Existing infrastructure should also be put into the equation, like data warehouses and relational data and the way they will complete the picture.

More innovation and interactivity

More innovations will come from the Big Data open source community. Hadoop 2.0 and other new projects related to open source and YARN, the latest generation resource manager of Hadoop, will make the infrastructure of Hadoop more interactive. STORM, a protocol for streaming communications and another open source project, will facilitate more on-demand mixing and real time information blending inside the Big Data ecosystem.

More upgrades for better analysis

The future of analytics will be characterized by new functionality, plug-ins and upgrades. This will enable faster moving, blending and analyzing relational capabilities. The ability of the adaptive data layer will be improved and secured to make it easier for clients to manage their data flows. In a sentence, technology cannot stand still to make better analysis

In the field of analytics, the simplification of discovery of data will remain unabated, thus making it easier to locate patterns and anomalies. New technologies like machine data, predictive data and also real time analytics will be ramped up into the mainstream production.

Tags :

Pentaho Launches Community Edition 5.0

Suresh Narayanappa

Pentaho Corporation recently announced the immediate availability of its Open Source Pentaho Community Edition 5.0 (Pentaho CDE). It is the latest version of business analytics and open source data integration platform. The launch event also included the Pentaho Marketplace, where members of the community can download and explore all available plug ins developed by the Pentaho Community. It has extended the capabilities of the open source platform and permits community members to work together, share feedback and submit or create new plug ins to broaden Pentaho functionality.

According to the company, the new edition offers an economical entry point to people or companies during their first brush with business analytics when they want to visualize and act upon data. It is also an excellent tool-set for practiced developers, users and consultants who prefer an open code base to extend the borders of Pentaho and also business analytics.
Pentaho Community Edition
New features in Pentaho Community Edition 5.0

Pentaho, in collaboration with its community of developers, has made a powerful tools suite that offers an open source option for data analysts and developers to meet their goals. The latest edition includes:

    • Business Analytics Platform: The modern, interactive and simplified approach of Pentaho helps business users to discover, access and blend all sizes and types of data. Users can take advantage of a wide range of advanced analytics, that range from simple reports to predictive modeling, and can analyze and also visualize data through multiple dimensions, while at the same time with minimum IT dependence.


    • Data Integration: Pentaho Data Integration, better known as Pentaho Kettle, delivers powerful transformation, loading and extraction capabilities. This is a stand-alone application and is utilized to visually design jobs, and aid easier reporting and analysis.


    • Report Designer: Pentaho Report Designer is a graphic design tool which has the capability to generate reports from the data streamed through Pentaho Data Integration engine with no requirement for any kind of intermediate staging tables. Output reports can be in PDF, HTML, XML, CSV, Excel and rich-text file.


  • Auxiliary Tools: Users can download various types of auxiliary tools, like Pentaho Aggregation Designer for a simple interface to first create and then deploy aggregate tables that improve the performance of the Pentaho OLAP Cubes. Mondrian Schema Workbench is the open source designer for the visual creating and testing of the Mondrian OLAP cube schemas. The Pentaho Metadata Editor offers a simplified tool that you can use to create reports, build domains of Pentaho Metadata or the relational data models.

Tags :