Twitter Analytics
Twitter analytics is using a wide variety of techniques to analyse twitter big data. Twitter offers the functionality to download twitter tweets in order to perform manually analytics on this data. But the Company...
Twitter analytics is using a wide variety of techniques to analyse twitter big data. Twitter offers the functionality to download twitter tweets in order to perform manually analytics on this data. But the Company...
RFID stands for radio-frequency identification and it enables the automatic identification and tracking of tags attached to objects. In many applications a very high number of those tags are used and thus it is...
A Support Vector Machine known as SVM is a classification technique developed around 1990 for data analysis. They perform very well in many settings and are considered as one of the best ‘out-of-the-box classifiers’....
DALY stands for Disability-Adjusted Life Years that is a measure used with health datasets to quantify the burden of diseases. Health data from patients such as those that suffer from chronic diseases contains insights...
The PANGAEA open data collection archives and publishes earth science datasets. It enables re-use of data and offers interested people to upload their datasets too. Some of its existing open datasets can be considered...
IEEE Big Data Conference 2016 In recent years, “Big Data” has become a new ubiquitous term. Big Data is transforming science, engineering, medicine, healthcare, finance, business, and ultimately our society itself. The IEEE Big...
A confidence interval estimates an interval of a specific parameter that tells us something about the overall data space from which our dataset is a sample. In statistics the overall data space is called...
Linear regression in R is quite straightforward and there are excellent additional packages like visualizing the dataset. This concrete contribution provides an example based on free data represents a short tutorial of linear regresion...
R datasets provides a couple of free datasets as part of the ‘Statistical Computing with R’ tool. This page provides a list of available datasets and in which libraries or packages they can be...
Free datasets are typically hard to obtain since either the data includes sensitive information or it was very costly to create them. This page provides an overview of available datasets in order to practice...