A data infrastructure is a key element to store, preserve, curate, and share big data in order to make it available for data analytics and analysis tasks. It is a digital infrastructure that offers data as digital objects in order to promote the data access and re-use in processing and analysis tasks. In many cases such infrastructures offers even more additional services like sensitive data management or federated identity management. Facebook or Instagram are just two well known industry examples that require a strong and powerful infrastructure behind their applications. Recently, European compute and data centers have started to realize a European Open Science Cloud that will be a powerful Cloud based data infrastructure in the next decades. More information about this cloud approach can be obtained here.
In science and engineering a well known US data infrastructure is DataOne that can be accessed here and is specifically focused on Earth science datasets. It provides access to such datasets across multiple member data repositories and offers enhanced search and discovery of Earth and environmental datasets. Another European collaborative data infrastructure is called EUDAT that can be accessed here and is rather multi-disciplinary offering a wide variety of services for researchers. In order to increase trust for its usage over twenty major European research organisations, including major European data and computing centres, have signed an agreement to sustain it for the next ten years. The third example in this field is called OpenAIRE that can be accessed here and is more optimized towards the finding of scientific publications and their associated datasets.
Data Infrastructure Details
Please take into account the following video: