Author: www.big-data.tips

Cross Validation Big Data Tips Machine Learning Mining Tools Analysis Analytics Algorithms Classification Clustering Regression Tool

Cross Validation

Cross validation is a smart technique to perform model selection during the validation process. The model selection performs a decision about a specific machine learning model (e.g. artificial neural network, decision trees, suppor vector...

Sampling Methods Big Data Tips Machine Learning Mining Tools Analysis Analytics Algorithms Classification Clustering Regression Tool

Sampling Methods

Sampling methods refer to techniques that pick a specifically choosen number of L samples out of a number of N data items in a dataset for data Analysis. More formally ‘sampling methods’ select a...

Kernel Methods Trick Big Data Tips Machine Learning Mining Tools Analysis Analytics Algorithms Classification Clustering Regression Tool

Kernel

A choosen Kernel is one out of several so-called Kernel methods in machine learning that enable a smart use of non-linear decision boundaries. Such decision boundaries are often necessary since in many cases the...

Gradient Descent Optimization Big Data Tips Machine Learning Mining Tools Analysis Analytics Algorithms Classification Clustering Regression Tool

Gradient Descent

Gradient descent refers to a technique in machine learning that finds a local minimum of a function. It is a quite general optimization technique used in many application areas. It can be used to...

Neural Network Optimization Big Data Tips Machine Learning Mining Tools Analysis Analytics Algorithms Classification Clustering Regression Tool

Neural Network

A neural network, more accurately referred to as Artificial Neural Network (ANN), is a quite complex data analysis technique. It is based on a well-defined architecture of many interconnected artificial neurons. But it also...

Sequential Minimal Optimization Big Data Tips Machine Learning Mining Tools Analysis Analytics Algorithms Classification Clustering Regression Tool

Sequential Minimal Optimization

The iterative algorithm Sequential Minimal Optimization (SMO) is used for solving quadratic programming (QP) problems. One example where QP problems are relevant is during the training process of support vector machines (SVM). The SMO...

ETL Extract Transform Load Database Big Data Tips Machine Learning Mining Tools Analysis Analytics Algorithms Classification Clustering Regression Tool

ETL – Extract Transform Load

ETL stands for the whole process of extracting, transforming, and loading big data using database tools. Extract means to get data out of different data sources. Transform means that the data format is changed...

NoSQL Databases Big Data Tips Machine Learning Mining Tools Analysis Analytics Algorithms Classification Clustering Regression Supervised Social Media

Social Media

Social Media is a massive source of Big Data today. The data is generated through user interactions or by just let other users know what one user is thinking about. The main information source...

BDVA Value Association Big Data Tips Machine Learning Mining Tools Analysis Analytics Algorithms Classification Clustering Regression Supervised Tool

BDVA – Big Data Value Association

The Big Data Value Association (BDVA) is a non-profit organisation with members from research and industry. It aims to boost European Big Data value research, development and innovation. That means it improves industrial competitiveness...