Category: Big Data Tools

Install Hadoop Apache Map-Reduce Analytics Big Data Tips Mining Tools Analysis Analytics Algorithms Classification Clustering Regression Supervised Learning

Install Hadoop

Install Hadoop on a cluster enables the ability to process and analyse big data using the map-reduce paradigm. This article provides a step-wise installation guide. Step 1 – Create Hadoop User Before we generate...

Ubuntu Generate SSH Key Key-Gen Process Big Data Tips Mining Tools Analysis Analytics Algorithms Classification Clustering Regression Supervised Learning

Ubuntu Generate SSH Key

Ubuntu generate SSH key stands for a process required to setup the right security environment for several big data frameworks like Apache Hadoop or Apache Spark. This setup is required to perform selected operations...

Recommendation Engine Movie Rentals Google Big Data Tips Mining Tools Analysis Analytics Algorithms Classification Clustering Regression Supervised Learning

Recommendation Engine

A recommendation engine is a tool that recommends products to consumers based on user preferences or being similar to other users when performing big data analysis. It is a tool that is the key...

Apache Spark vs Hadoop Comparison Big Data Tips Mining Tools Analysis Analytics Algorithms Classification Clustering Regression Supervised Learning

Apache Spark vs Hadoop

Apache Spark vs Hadoop is an interesting comparison between two key technologies that are able to work with big data. One aspect for understanding a key difference of both relies in the fact that...

Spark Machine Learning Apache MLlib Big Data Tips Mining Tools Analysis Analytics Algorithms Classification Clustering Regression Supervised Learning Tool

Spark Machine Learning

Spark machine learning algorithms are implemented in the machine learning library (MLlib) of Apache Spark that is able to handle Big Data. It is a scalable and parallel machine learning library with a number...

SAAS Examples Software-As-A-Service CRM ERP Cloud Computing Big Data Tips Machine Learning Mining Tools Analysis Analytics Algorithms Clustering Tool

SAAS Examples

SAAS examples can be found in many application domains whereby SAAS stands for Software-As-A-Service. Examples of SAAS typically provide specialized application interfaces that enable users to focus on one particular domain of business or...

CUDA Programming Compute Unified Device Architecture NVIDEA GPU Big Data Tips Machine Learning Mining Tools Analysis Analytics Algorithms Clustering Tool

CUDA Programming

CUDA programming refers to the way of developing a program for the Compute Unified Device Architecture that is the computing engine in NVIDEA GPUs. CUDA has been used in many applications to accelerate computing...

SLURM Workload Manager Job Scheduler Jobscheduler Big Data Tips Machine Learning Mining Tools Analysis Analytics Algorithms Clustering Regression Tool

SLURM Workload Manager

SLURM stands for Simple Linux Utility for Resource Management and is a job scheduler tool used in high performance computing (HPC) environments in order to process big data. It is free and open-source and...