Exploring Big Data and Data Science

Course provider: STFC Hartree Centre
Target profile: Researchers in big data analytics, data science or related fields
Date:
04 May 2017
Registration deadline:
03 May 2017
Place: Daresbury Laboratory, Cheshire (
United Kingdom
)
Course level: Other
Keywords: Big Data, Data Science, data analytics

Course description:

Join us at the STFC Hartree Centre on the 4 May 2017 to explore some of the techniques, tools and environments used to tackle data science challenges at some of the world's most renowned international research facilities. This workshop is aimed at researchers, academics and technical staff working in (or with a keen interest in) big data analytics, data science or closely related fields.​

The workshop will begin with two talks from  Professor Geoffrey Fox from Indiana University on subject of the Apache Big Data Stack and the concept of Big Data Ogres. The Ogres are a way of analyzing the ecosystem of two prominent paradigms for data-intensive applications – for both high performance computing (HPC) and the Apache-Hadoop paradigm. They provide a means of understanding and characterizing the most common application workloads found across the two paradigms. HPC-ABDS, the HPC enhanced Apache Big Data Stack (ABDS) uses the major open source big data software environment but develops the principles allowing use of HPC software and hardware to achieve good performance.​

​Also included in the programme ​are talks from three different industry persepectives: Kenji Takeda from Micro​soft Research will discuss cloud services and big data, while Andy Grant from Atos will follow with discussion on big data applications in hybrid cloud environments. Jason Crain will discuss IBM Watson Technologies and Cognitive Computing. 

In the final session, Arjun Shanker from Oak Ridge National Laboratory (ORNL) will talk about their lab's innovative Compute And Data Environment for Science (CADES.) This system connects experimental facilities, HPC systems, data scientists and researchers at Oak Ridge National Laboratory. The CADES system aims to provide an integrated compute infrastructure delivering data science solutions and workflows for present and future research programs. Roger Downing from the Hartree Centre will follow this with discussion around how big data and cognitive technologies are being applied to UK industry in the Hartree Centre's collaborative projects. ​

Application procedure:

Full details and registration procedure: https://www.hartree.stfc.ac.uk/Pages/Big-Data-and-Data-Science-Workshop.aspx