Category: Big Data Tools

SLURM Workload Manager Job Scheduler Jobscheduler Big Data Tips Machine Learning Mining Tools Analysis Analytics Algorithms Clustering Regression Tool

SLURM Workload Manager

SLURM stands for Simple Linux Utility for Resource Management and is a job scheduler tool used in high performance computing (HPC) environments in order to process big data. It is free and open-source and...

SRUN Command Parameters SLURM Job Scheduler Big Data Tips Machine Learning Mining Tools Analysis Analytics Algorithms Clustering Regression Tool

SRUN Command Parameters SLURM

The SRUN command in SLURM enables the submission of a parallel high performance computing (HPC) job in a batch job script in order to analyze big data. It is usually often in job scripts...

Amazon S3 Simple Storage Service Cloud Infrastructure Big Data Tips Machine Learning Mining Tools Analysis Analytics Algorithms Clustering Regression Tool

Amazon S3 Storage

Amazon S3 is a simple storage service used for retrieving and storing user data from and to the remote Amazon cloud infrastructure that is able to handle big data. It enables selected datasets considered...

OpenStack Swift Object Storage Cloud Service Big Data Tips Machine Learning Mining Tools Analysis Analytics Algorithms Clustering Regression Tool

OpenStack Swift

OpenStack Swift is one of the core services of OpenStack that manages and control cloud resources. Swift is a very powerful service in order to manage and provide object storage and is thus a...

XEN Hypervisor Open Source Server Virtualization Big Data Tips Machine Learning Mining Tools Analysis Analytics Algorithms Clustering Regression Tool

XEN Hypervisor

The XEN hypervisor is an open source hypervisor used for virtual machine management. It thus enables a hardware level virtualization and is installed on top of the bare-metal hardware. Therefore this hypervisor technology implements...

BOINC Distributed Computing Volunteer Computing Big Data Tips Machine Learning Mining Tools Analysis Analytics Algorithms Clustering Regression Tool

BOINC Middleware

BOINC stands for Berkeley Open Infrastructure for Network Computing that essentially is a distributed computing tool. It provides middleware functionality and thus is able to work with big data in large distributed systems. This...

Amazon EC2 Elastic Compute Cloud Infrastructure Big Data Tips Machine Learning Mining Tools Analysis Analytics Algorithms Clustering Regression Tool

Amazon EC2

Amazon EC2 provides an elastic compute cloud (EC2) power in the Amazon cloud infrastructure that is able to handle big data. It enables customers to create virtual machines and provides functionality to manage end...

Amazon Web Services AWS Cloud IAAS Big Data Tips Machine Learning Mining Tools Analysis Analytics Algorithms Clustering Regression Tool

Amazon Web Services

Amazon Web Services (AWS) is a commercial cloud computing offering that follows the infrastructure as a service (IAAS) model. It is based on virtual machines that are used to flexible share computing and storage...

Google App Engine Cloud Platform PAAS Big Data Tips Machine Learning Mining Tools Analysis Analytics Algorithms Clustering Regression Tool

Google App Engine

Google App Engine is a cloud platform that enables a wide variety of cloud and Web applications in the context of big data. This offering from Google is a Platform as a Service (PAAS)...