Fun thoughts

/Fun thoughts

Eclipse for Perl Developers

I've spent a lot of time in Perl using Eclipse lately. Personally, I prefer to use vim for most of my development. Lately though, I've been spending a lot of time in some complex java with many different references to many different classes; I'll be honest, I am enjoying the index and code-completion. I have [...]

By | June 11th, 2013|Application Quality, cloud development, Fun thoughts, Global Development, tip|Comments Off on Eclipse for Perl Developers

Big Data Implementations

Big Data Implementations. Big Data is conceptually a collection of data and the means to store (a file system) and access it (a runtime). This means that all of the components are swappable including the very core components that were the archetypal definition of Big Data – the file system (HDFS) and the execution environment [...]

By | April 16th, 2013|Big Data, BigSQL, Fun thoughts, Hadoop|Comments Off on Big Data Implementations

Hadoop

Hadoop - HDFS and MapReduce. Hadoop came out in 2008 and the primary components of it (HDFS and MapReduce) conceptually look like this. Hadoop, HDFS and MapReduce MapReduce splits work across many machines. Map takes the job, breaks it into many small reproducible pieces and sends them to different machines and runs them [...]

By | April 16th, 2013|Big Data, Fun thoughts, Hadoop|Comments Off on Hadoop

HDFS

HDFS, The Hadoop Distributed File System. When you Map a problem you break it down into key value pairs and spread it out across a number of processors – abstract it to its simplest components and give them to lots of workers. HDFS is a file system specifically designed to handle LOTS of key value [...]

By | April 16th, 2013|Big Data, Fun thoughts, Hadoop, High Availability, OpenSource|Comments Off on HDFS

MapReduce

MapReduce. MapReduce has a language and there is a syntax to this language (like English). Behind MapReduce are concepts that the language implements. You don't need to understand the language to appreciate the elegance of MapReduce, you need a good analogy. Here's mine: It is 1930. Alan Turing and Alonzo Church in England have just [...]

By | April 16th, 2013|Big Data, Fun thoughts, Hadoop, High Availability, OpenSource|Comments Off on MapReduce

Big Data

Imagine what Big Data can do for you. Big Data is a term, like database, that can mean different but similar things depending on context: Big Data is archetypically the Hadoop Distributed File System (HDFS) and MapReduce. The open source Apache project Hadoop includes HDFS and an implementation of MapReduce. Big Data is a collection [...]

By | April 15th, 2013|Big Data, BigSQL, Fun thoughts, Hadoop, High Availability, OpenSource|Comments Off on Big Data