Movie Ratings
Movie ratings refers to data sets whereby users rate the movies they see in order to give advice to other users. They are available as a free dataset from the MovieLens web site that...
Movie ratings refers to data sets whereby users rate the movies they see in order to give advice to other users. They are available as a free dataset from the MovieLens web site that...
Product recommendation is about predicting ratings of users and creating personalized recommendations for users for a wide variety of products like books, clothes, movies, videos, or songs. Today this is mostly done online when...
A recommendation engine is a tool that recommends products to consumers based on user preferences or being similar to other users when performing big data analysis. It is a tool that is the key...
Page rank is a machine learning technique that increases the effectiveness of search engines and improve their efficiency. It is used to measure the importance of a page and to prioritize pages returned from...
Machine learning applications can be found in a wide variety of application domains that take advantage of learning from big data. Applications in this area are given when the problem to be solved changed...
Speech recognition refers to the recognition of spoken speech in terms of converting the acoustic speech signal into an ASCII text. This acoustic speech signal can be considered as big data in many application...
Apache Spark vs Hadoop is an interesting comparison between two key technologies that are able to work with big data. One aspect for understanding a key difference of both relies in the fact that...
Spark RDD stands for ‘Resilient Distributed Dataset’ that are a key concept in the Apache Spark tool in order to work with big data. RDDs are a fault-tolerant collection of elements to operate on...
A Spark summit is a great opportunity to explore and discuss a wide range of topics that are related to big data processing with Apache Spark. Participants include technology experts highlighting different Spark features...
Spark machine learning algorithms are implemented in the machine learning library (MLlib) of Apache Spark that is able to handle Big Data. It is a scalable and parallel machine learning library with a number...