This tutorial covers setting up of BIRT (Business Intelligence Reporting Tool) to run against Hadoop. BIRT is a relatively simple and yet powerful analytics tool and the tutorial will help you harness the power against Hadoop.

1. Pre-Requisites

BigSQL Tutorial

The BigSQL Tutorial (http://www.bigsql.org/se/tutorial.jsp) needs to be followed till step 3

Step 1 : Creating example.customer_history in HIVE and loading values.
Step 2 : Creating example.customer_history as foreign table in Postgres
Step 3 : Verifying example.customer_history

Eclipse with BIRT

a) Download Eclipse with BIRT framework for your OS version from http://www.eclipse.org/downloads/packages/eclipse-ide-java-and-report-developers/keplerr

  • Select your OS version from Link column

  • Select a nearby mirror for the download location

b) Extract the downloaded .tar file and run Eclipse , setting up the workspace location

2. Eclipse, Creation of Report

a) Run an instance of eclipse with Birt reporting configured .

b) In the Menu , click on Window → Open Perspective → Report Design

c) From the menu Click on File → New → Report

  • Enter Parent Folder

  • Enter File Name
  • Click on Finish button

3. Data Source configuration

a) Verify if BIGSQL is running

$ ./bigsql status

#################################################
# BIGSQL:- your BIGSQL version
# TIME: current time
# HOST: your hostname
# JAVA: path to your JAVA_HOME
# OS: your OS
#################################################
## Checking critical ports ################
Postgres port 5432 is busy.
ZooKeeper port 2181 is busy.
HDFS port 50070 is busy.
HBase port 60010 is busy.
Hive port 9083 is busy.
Tomcat port 8080 is busy.

b) Open Window → Show View → Data Explorer

c) Download JDBC file from http://jdbc.postgresql.org/download/postgresql-9.2-1003.jdbc4.jar

d) Right click on Data Sources folder and select ‘ New Data Source’

e) Select ‘JDBC Data Source’ and click Next button.

f) Click on Manage Drivers and Add the JDBC Driver downloaded in (c) .

g) Select /Enter

  • Driver Class : org.postgresql.Driver
  • Database URL : jdbc:postgresql:postgres
  • User Name : postgres
  • Password : password

h) Click “ Test Connection” button to verify the connection

4. Dataset and Report

a) Right click on ‘Datasets’ and select ‘New Data Set’

b) Select JDBC Data source , Enter Data Source name and click next .

c) Enter in the “Query Text” window :

select hist_id , h_date , h_data from example.customer_history;

d) Drag the created Data Set and drop it in the blank report window to create the reporting table.

5. Report View

In the top menu , Click on Run → View Report → In Web Viewer