
What is Apache Sqoop?
Sqoop provides a mechanism to connect the external systems like EDW (Enterprise Data Warehouse like Amazon Redshift), Relational Database...

Installing Cloudera Manager
Installing Cloudera Manager 5.4.1 in VirtualBox/Linux/CentOS A step by step guide to install Cloudera Manager in VirtualBox for a clean...

What is MapReduce in Hadoop ?
Explaining MapReduce with an example... MapReduce is a programming model and an associated implementation for processing and generating...

HDFS Architecture
HDFS has a master/slave architecture. An HDFS cluster consists of a single NameNode, a master server that manages the file system...

Hadoop Cluster
Overview of Hadoop Cluster A Hadoop cluster is a special type of computational cluster designed specifically for storing and analyzing...

Understanding Context Variables in Talend
Talend is having the concept of Context Variables; we can access them anywhere in our jobs once it is created. These context variables...

Talend Web Service using REST API
The main functionality of tRESTClient component sends HTTP and HTTPS requests to a Representational State Transfer (REST) Web service...

Talend – DI - Connect components with Triggers or Rows
Row Connections The row form of actual data can be process to next component using below. Trigger The row form of actual data can be...

Talend - DI - Java - Custom Code Components
Custom Component tJava: This component can be use as trigger component, at start of the job at end of the job. tJavaRow : this component...