Hadoop Configuration
Hadoop configuration is an important process in order to setup the Apache Hadoop software to handle big data using the map-reduce paradigm. This article provides a step-wise configuration guide based on Hadoop version 2.8.0...
Hadoop configuration is an important process in order to setup the Apache Hadoop software to handle big data using the map-reduce paradigm. This article provides a step-wise configuration guide based on Hadoop version 2.8.0...
This ‘Install Oracle Java Ubuntu’ article informs about the installation process of Java 8 on Ubuntu in order to be used with big data frameworks such as Apache Hadoop or Apache Spark. It provides...
Install Hadoop on a cluster enables the ability to process and analyse big data using the map-reduce paradigm. This article provides a step-wise installation guide. Step 1 – Create Hadoop User Before we generate...
Ubuntu generate SSH key stands for a process required to setup the right security environment for several big data frameworks like Apache Hadoop or Apache Spark. This setup is required to perform selected operations...
A recommendation engine is a tool that recommends products to consumers based on user preferences or being similar to other users when performing big data analysis. It is a tool that is the key...
Apache Spark vs Hadoop is an interesting comparison between two key technologies that are able to work with big data. One aspect for understanding a key difference of both relies in the fact that...
Spark RDD stands for ‘Resilient Distributed Dataset’ that are a key concept in the Apache Spark tool in order to work with big data. RDDs are a fault-tolerant collection of elements to operate on...
Spark machine learning algorithms are implemented in the machine learning library (MLlib) of Apache Spark that is able to handle Big Data. It is a scalable and parallel machine learning library with a number...
SAAS examples can be found in many application domains whereby SAAS stands for Software-As-A-Service. Examples of SAAS typically provide specialized application interfaces that enable users to focus on one particular domain of business or...
CUDA programming refers to the way of developing a program for the Compute Unified Device Architecture that is the computing engine in NVIDEA GPUs. CUDA has been used in many applications to accelerate computing...