Author: www.big-data.tips

Parallel Processing Definition Parallel Computing Big Data Tips Machine Learning Mining Tools Analysis Analytics Algorithms Clustering Regression Tool

Parallel Processing Definition

Understanding the essence of the parallel processing Definition is very important due to the use of parallelization in handling big data challenges. An interesting book that includes the definitions here can be found in...

Parallel Processing Example Parallelization Computing Big Data Tips Machine Learning Mining Tools Analysis Analytics Algorithms Clustering Regression Tool

Parallel Processing Example

Understanding a parallel processing example is quite important in order to understand the key concepts in parallelization and domain decomposition. This is in turn important since big data problems are typically solved using technologies...

SLURM Workload Manager Job Scheduler Jobscheduler Big Data Tips Machine Learning Mining Tools Analysis Analytics Algorithms Clustering Regression Tool

SLURM Workload Manager

SLURM stands for Simple Linux Utility for Resource Management and is a job scheduler tool used in high performance computing (HPC) environments in order to process big data. It is free and open-source and...

SRUN Command Parameters SLURM Job Scheduler Big Data Tips Machine Learning Mining Tools Analysis Analytics Algorithms Clustering Regression Tool

SRUN Command Parameters SLURM

The SRUN command in SLURM enables the submission of a parallel high performance computing (HPC) job in a batch job script in order to analyze big data. It is usually often in job scripts...

Job Scheduler Scheduling Software Jobscheduler Big Data Tips Machine Learning Mining Tools Analysis Analytics Algorithms Clustering Regression Tool

Job Scheduler

A job Scheduler handles the multiple requests from different concurrent end users of a computing system that are analyzing big data in form of a so-called computing job. These schedulers are also sometimes called...

Amazon S3 Simple Storage Service Cloud Infrastructure Big Data Tips Machine Learning Mining Tools Analysis Analytics Algorithms Clustering Regression Tool

Amazon S3 Storage

Amazon S3 is a simple storage service used for retrieving and storing user data from and to the remote Amazon cloud infrastructure that is able to handle big data. It enables selected datasets considered...

GPU Graphics Processing Unit Computing Acceleration Big Data Tips Machine Learning Mining Tools Analysis Analytics Algorithms Clustering Regression Tool

GPU Graphics Processing Unit

GPU stands for graphics processing unit and is a relatively new mechanism used for parallel approaches in order to analyse big data. This is particular the case for data parallelism and task parallelism. In...

OpenStack Swift Object Storage Cloud Service Big Data Tips Machine Learning Mining Tools Analysis Analytics Algorithms Clustering Regression Tool

OpenStack Swift

OpenStack Swift is one of the core services of OpenStack that manages and control cloud resources. Swift is a very powerful service in order to manage and provide object storage and is thus a...

XEN Hypervisor Open Source Server Virtualization Big Data Tips Machine Learning Mining Tools Analysis Analytics Algorithms Clustering Regression Tool

XEN Hypervisor

The XEN hypervisor is an open source hypervisor used for virtual machine management. It thus enables a hardware level virtualization and is installed on top of the bare-metal hardware. Therefore this hypervisor technology implements...

Multi Core Processor Multi-Core Multicore CPU Big Data Tips Machine Learning Mining Tools Analysis Analytics Algorithms Clustering Regression Tool

Multi Core Processor

A multi core processor is a modern technology based on the improvements in processor and network Technologies. The significant advances in CPU chips over years contributed to multi-core architectures and is a key to...