cadym@openscg.com

/Cady Motyka

About Cady Motyka

This author has not yet filled in any details.
So far Cady Motyka has created 10 blog entries.

Oozie- Hadoop Workflow Scheduler in BigSQL

This tutorial will show you how to create and Oozie Workflow using either BigSQL Hue or the Oozie command line tools, in the latest version of BigSQL. Oozie is a workflow scheduler that will allow you to manage Hadoop jobs. You can also create an Oozie workflow that incorporates shell scripts, map-reduce applications, and even Pig, [...]

By | January 29th, 2014|Categories: BigSQL, Hadoop|Comments Off on Oozie- Hadoop Workflow Scheduler in BigSQL

Sqoop2 in BigSQL Hue

This tutorial will show you how to import PostgreSQL data into the Hadoop File System using the Sqoop 2 application within Hue. Hue, or the Hadoop User Experience, allows you to use Oozie, Hive, Pig, Postgres, Sqoop, Hbase, Zookeeper, and Hadoop from a browser. Since release three of the BigSQL Quick-start Virtual Machine comes with Hue [...]

By | January 8th, 2014|Categories: BigSQL, Hadoop, PostgreSQL|Comments Off on Sqoop2 in BigSQL Hue

Sqoop 2, Importing from PostgreSQL to HDFS

This tutorial will show how to configure Sqoop 2, start the server and client, and import a table from Postgres to HDFS. The latest version of Sqoop is part of BigSQL release two, which you can download here. The first part of this tutorial involves creating the customer history Postgres table using BenchmarkSQL: $ . ./setENV.sh $ [...]

By | December 26th, 2013|Categories: BigSQL, Hadoop, PostgreSQL|Comments Off on Sqoop 2, Importing from PostgreSQL to HDFS

Hive, Postgres, and Hadoop Foreign Data Wrapper Video Tutorial

This demo shows how to run an example in BigSQL that uses Hive, Hadoop, PostgreSQL, and the Hadoop Foreign Data Wrapper to leverage the power of Hadoop from within PostgreSQL. This tutorial uses BigSQL 9.3 Release 2 - Beta2, which includes Hadoop-2.2 and Hive-0.12. You can download the newest release, or the Quick start VM [...]

By | November 25th, 2013|Categories: BigSQL, Cady PostgreSQL, FDW (Foreign Data Wrapper), Hadoop, PostgreSQL|Comments Off on Hive, Postgres, and Hadoop Foreign Data Wrapper Video Tutorial

Installing and Running BigSQL, the Hadoop and Postgres Bundle

This tutorial will show you how to install and run BigSQL. You can download the bundle from BigSQL.org and get started using Hadoop, Hive, Hbase, and PostgreSQL in less than ten minutes. The video below goes over how to: Complete the prerequisites (the commands used can be found here) Start the bundle Open the psql, hive, [...]

By | October 16th, 2013|Categories: BigSQL, OpenSCG, OpenSource|Comments Off on Installing and Running BigSQL, the Hadoop and Postgres Bundle

PostgreSQL, Hadoop, and Pentaho to Analyze and Visualize Data

This tutorial shows how to use Postgres, Hadoop, Hive, Hbase, and Pig to load, refine, and store big data for visualization- all in fewer than five minutes. You can watch the BigSQL, Postgres + Hadoop, Pentaho Demo here or follow along with the written version below. [youtube url="http://www.youtube.com/watch?v=eSEe_33pImA" fs="1" rel="0"] The following tutorial shows the scripts [...]

By | September 30th, 2013|Categories: Big Data, BigSQL, Cady PostgreSQL, FDW (Foreign Data Wrapper), Hadoop, PostgreSQL|Tags: |Comments Off on PostgreSQL, Hadoop, and Pentaho to Analyze and Visualize Data

Mondrian, Hadoop & Postgres Tutorial

This tutorial will provide instructions on how to install Mondrian, an Online Analytical Processing Analysis Server, with Tomcat and Hadoop in BigSQL and enable users to analyze large quantities of data in real-time. 1. First, you will need to download the following: a) Download Mondrian 3.5.0 Developers Version from the path http://sourceforge.net/projects/mondrian/files/latest/download b) Extract the downloaded [...]

By | September 11th, 2013|Categories: BigSQL, Hadoop|Comments Off on Mondrian, Hadoop & Postgres Tutorial

kCGErrorFailure with BigSQL

So you are trying to start BigSQL on your mac and get the following error: java[5413] <Error>: kCGErrorFailure: Set a breakpoint @ CGErrorBreakpoint() to catch errors as they are logged. after trying to run the following command: hadoop namenode -format The issue may have to do with the combination of your OS and Java version [...]

By | September 6th, 2013|Categories: BigSQL|Tags: , |Comments Off on kCGErrorFailure with BigSQL

Using Hadoop To Flume Twitter Data

Flume is a great tool by apache that can move large amounts of data, from multiple sources, to a single data store like Hadoop. In BigSQL, we are using Flume to move the log4j files from the benchmark program to HDFS. This process also uses the Hive SerDe (or serialization and deserialization) properties, as shown in the [...]

By | September 4th, 2013|Categories: BigSQL, Hadoop|Comments Off on Using Hadoop To Flume Twitter Data

Hadoop Thrift Tutorial

At OpenSCG we have been using thrift to make a Hadoop Foreign Data Wrapper. This tool, which is already integrated into BigSQL, allows you to take advantage of the power of Hadoop from within PostgreSQL. The BigSQL Tutorial show how you can easily create a postgres table that references a Hive or Hbase table, run [...]

By | August 21st, 2013|Categories: BigSQL, Cady PostgreSQL, FDW (Foreign Data Wrapper), Hadoop, PostgreSQL|Tags: , |Comments Off on Hadoop Thrift Tutorial