SlideShare uma empresa Scribd logo
1 de 18
Green Shoots:
RDM Pilot at Imperial College London
Ian McArdle
Head of Research Systems & Information
i.mcardle@imperial.ac.uk
Torsten Reimer
Project Manager: Open Access & RDM
t.reimer@imperial.ac.uk
Presenting projects by: M. Bearpark & C. Fare; G. Thomas, S. Butcher & C. Tomlinson; M. Mueller;
H. S. Rzepa, M. J. Harvey, N. Mason & A. Mclean; G. Gorman, C. T. Jacobs & A. Avdis; N. Jones
www.imperial.ac.uk/researchsupport/rdm/policy/greenshoots
IDCC15, 10th February 2015
Imperial College London
• Seven London campuses
• Four Faculties: Engineering,
Medicine, Natural Sciences and
Business School
• Ranked 2nd in world
(QS University Ranking)
• Net income (2014): £855m, incl. £351m research grants and contracts
• ~15,000 students, ~7,200 staff, incl. ~3,700 academic & research staff
• Staff publish ~10,000 scholarly articles per year
http://www.imperial.ac.uk/
College Position Statement
“Imperial College London is committed to promoting the
highest standards of academic research, including
excellence in research data management. This includes
a robust digital curation infrastructure that supports open
data access and protects confidential data.
Imperial acknowledges legal, ethical and commercial
constraints on data sharing and the need to preserve the
academic entitlement to publication.”
- Approved by Provost Board, publicised via Staff Briefing
Investing in RDM
“Green Shoots” scheme is born
So where, specifically, should the College invest?
Considering large research income and reputation, Imperial cannot
afford to “get it wrong”
College acknowledges that excellence in RDM will require
significant investment and academic engagement
“Green Shoots” Funding - £100K Investment
What did we want?
• Academically-driven projects to
demonstrate best practice in
RDM
• Specifically frameworks /
prototypes that would comply
with funder policies and College
position
• Frameworks could be based
either on original ideas or
integrating existing solutions into
the research process
• Projects that supported Open
Innovation and open access for
data
What did we hope to achieve?
• Encourage a “bottoms up”
approach to maximise use of
local early adopters and
innovators
• Generate solutions that could be
grown to support RDM more
widely
• Demonstrate that innovative,
academically-driven, beneficial
RDM is possible and to stimulate
this further
• Advice concerning how Imperial
should proceed in supporting
RDM
FUNDING OPPORTUNITY:
Research Data Management
More Information: http://www.imperial.ac.uk/researchstrategy/funding
Contact: Ian McArdle i.mcardle@imperial.ac.uk
Submission Deadline: Friday 28th March 2014
Funding is available for academically-driven projects to identify
and generate exemplars of best practice in Research Data
Management (RDM), specifically frameworks and prototypes
that comply with key funder RDM policies and the College
position.
There is an expectation that solutions will support open access
for data and solutions that support Open Innovation are
strongly encouraged.
Funded Projects
• Haystack – A Computational Molecular Data Notebook
• M. Bearpark & C. Fare
• The Imperial College Tissue Bank: A Searchable Catalogue for Tissues, Research
Projects and Data Outcomes
• G. Thomas, S. Butcher & C. Tomlinson
• Integrated Rule-Based Data Management System for Genome Sequencing Data
• M. Mueller
• Research Data Management in Computational and Experimental Molecular
Science
• H. S. Rzepa, M. J. Harvey, N. Mason & A. Mclean
• Research Data Management: Where Software Meets Data
• G. Gorman, C. T. Jacobs & A. Avdis
• Research Data Management: Placing [Time Series] Data in its Context
• N. Jones
Haystack – A Computational Molecular Data Notebook
M. Bearpark & C. Fare
Idea
• Extend a working prototype of a computational chemical IPython notebook
making it available for all on github
Achievements
• Installation is now much simplified
• A tree document structure has been implemented
• Calculations using mainstream computational chemistry software can be set
up
• Calculations can be submitted to run on a high-performance computing cluster
• Data from completed calculations can be retrieved and visualised
RDM Benefits
• Enables computational molecular researchers to easily share a curated subset
of their results and document how those results were generated
More Information
• http://github.com/clyde-fare/cc_notebook
Imperial College Tissue Bank: A Searchable Catalogue for Tissues,
Research Projects and Data Outcomes
G. Thomas, S. Butcher & C. Tomlinson
Idea
• Extend the ICH tissue bank infrastructure to accept and catalogue research data
alongside the collection of 60,000 physical tissues specimens and donor records
Achievements
• A tool to automatically exchange data with the National Cancer Registry was built,
updating patient outcome data where known
• A pipeline to transfer summary sequencing data and metadata into the tissue bank
and a UI to view this information
• Prototyped a means for tracking location of associated raw sequencing data for
future development
• Began to investigate means to link publications back to associated tissue samples
RDM Benefits
• Enhances existing datasets and enables their reuse to maximise the benefits
gained from each tissue sample
More Information
• http://www.imperial.ac.uk/tissuebank/
Integrated Rule-Based Data Management System for Genome
Sequencing Data
M. Mueller
Idea
• Set up a data management system for the DNA sequencing service that will integrate with
existing central Imperial HPC infrastructure for processing, analysis and dissemination of raw
data and analysis results
Achievements
• See system on following slide
• iRODS-based system was implemented that:
• 1 – Transfers data from sequencer to HPC Service (different campus)
• 2 – Data are reformatted and split by sample and project and a quality report generated
• 3 – Reads are mapped to a reference genome, reformatting again, reducing file size
• 4 – Further compression achieved via compression algorithm
• 5 – Data transferred to a webserver and made available for download
• Overcame concerns over authentication by excluding the HPC storage from iRODS
RDM Benefits
• A robust infrastructure is now in place to effectively manage large volumes of complex
sequencing data
• The data are being made publicly available for re-use of this expensive resource
More Information
• http://www.imperial.ac.uk/genomicsfacility/informatics/
Integrated Rule-Based Data Management System for Genome
Sequencing Data
M. Mueller
Research Data Management in Computational and Experimental
Molecular Science
H. S. Rzepa, M. J. Harvey, N. Mason & A. Mclean
Idea
• Address sustainability and scalability of a hub interfacing electronic lab notebooks with
HPC resources and digital data repositories
Achievements
• Produced an installer package to allow reuse of uportal DSpace front end
• Enhanced metadata in local repository to make it compliant with DataCite specifications –
all repository content automatically receives a DOI
• Integrated ORCID into their solution
• Developed a procedure using DOIs for directly retrieving data from a digital repository and
displaying it using Javascript components
• Curated 170,000 datasets from Cambridge to Imperial, adding standards-based metadata
RDM Benefits
• Molecular data can be referenced more robustly with persistent identifiers – step forward in
data citation
More Information
• http://doi.org/10042/a3v1w
Research Data Management in Computational and Experimental
Molecular Science
H. S. Rzepa, M. J. Harvey, N. Mason & A. Mclean
Research Data Management: Where Software Meets Data
G. Gorman, C. T. Jacobs & A. Avdis
Idea
• Integrating research data management into the research workflow so that data and software can
be curated at the push of a button using Figshare and Git
Achievements
• Developed and released an open source software library: PyRDM
• Automatically transfers software source code (stored under Git control) and data to Figshare
• Figshare generates a DOI for that code version and the data
• Metadata including author details and cross-referencing between code and data are uploaded
automatically
• Hoping for ORCID authentication via Figshare API to be added
• PyRDM was integrated into the Fluidity computational fluid dynamics code
• DOIs minted are stored in Fluidity to improve data provenance and allow a new revision of the
repository to be created if the data are updated at a later stage
RDM Benefits
• Research data published in line with funder expectations
• The DOI for a specific code version enables better recomputability of data
• Automated metadata generation reduces academic burden
More Information
• http://github.com/pyrdm http://dx.doi.org/10.5334/jors.bj www.fluidity-project.org
Research Data Management: Placing [Time Series] Data in its Context
N. Jones
Idea
• Provide a platform and technology which automatically connects researchers
through their time-series data, models and analysis methods
Achievements
• Online interdisciplinary collection of time-series data and time-series analysis code
• Functionality to automatically profile time series
• Functionality to automatically profile time series algorithms
• Functionality to use these profiles to place a user’s work in the context of others
RDM Benefits
• Incentivises data sharing by allowing data comparison – increases discoverability of
an academic’s data plus increases likelihood of finding other relevant data
• Resource also available to general public
More Information
• http://www.comp-engine.org/timeseries/
Research Data Management: Placing [Time Series] Data in its Context
N. Jones
Overall Conclusions
Good data curation is HARD and EXPENSIVE
Development of sustainable research software is also HARD and EXPENSIVE
Data citation is
important
Immediate
incentives help
APIs useful
preferably open
Auto-generation
of metadata
E lab books
seem useful
Clinical data is a
minefield
Nucleus of an RDM community
at Imperial
Ideas to consider for wider deployment for
cross-College benefit
Thanks and Questions
Review of applications:
• Kevin Ashley, DCC Director
Green Shoots academics:
• M. Bearpark & C. Fare
• G. Thomas, S. Butcher & C. Tomlinson
• M. Mueller
• H. S. Rzepa, M. J. Harvey, N. Mason & A. Mclean
• G. Gorman, C. T. Jacobs & A. Avdis
• N. Jones
Provision of funds:
• Imperial Vice-Provost Advisory Group: Research

Mais conteúdo relacionado

Mais procurados

Introducing ORCID at Imperial College London
Introducing ORCID at Imperial College LondonIntroducing ORCID at Imperial College London
Introducing ORCID at Imperial College LondonTorsten Reimer
 
UKSG Conference 2016 Breakout Session - Jisc open access services to support ...
UKSG Conference 2016 Breakout Session - Jisc open access services to support ...UKSG Conference 2016 Breakout Session - Jisc open access services to support ...
UKSG Conference 2016 Breakout Session - Jisc open access services to support ...UKSG: connecting the knowledge community
 
Supporting the development of a national Research Data Discovery Service – a ...
Supporting the development of a national Research Data Discovery Service – a ...Supporting the development of a national Research Data Discovery Service – a ...
Supporting the development of a national Research Data Discovery Service – a ...EDINA, University of Edinburgh
 
Finalrevc
FinalrevcFinalrevc
FinalrevcSUNCAT
 
Jisc Publications Router: Delivering Open Access Content to Institutions
Jisc Publications Router: Delivering Open Access Content to InstitutionsJisc Publications Router: Delivering Open Access Content to Institutions
Jisc Publications Router: Delivering Open Access Content to InstitutionsEDINA, University of Edinburgh
 
Common ground: A view of the open education landscape in Scotland
Common ground: A view of the open education landscape in Scotland Common ground: A view of the open education landscape in Scotland
Common ground: A view of the open education landscape in Scotland Lorna Campbell
 
Preparing for the UK Research Data Registry and Discovery Service
Preparing for the UK Research Data Registry and Discovery ServicePreparing for the UK Research Data Registry and Discovery Service
Preparing for the UK Research Data Registry and Discovery ServiceRepository Fringe
 
Introduction to the University Data Library and national data services
Introduction to the University Data Library and national data servicesIntroduction to the University Data Library and national data services
Introduction to the University Data Library and national data servicesEDINA, University of Edinburgh
 
DIY Research Data Management Training Kit for Librarians
DIY Research Data Management Training Kit for LibrariansDIY Research Data Management Training Kit for Librarians
DIY Research Data Management Training Kit for LibrariansEDINA, University of Edinburgh
 
Optimising Resources to develop a strategic approach to OA
Optimising Resources to develop a strategic approach to OAOptimising Resources to develop a strategic approach to OA
Optimising Resources to develop a strategic approach to OARepository Fringe
 
Implementing Open Access – BU and UCL
Implementing Open Access – BU and UCLImplementing Open Access – BU and UCL
Implementing Open Access – BU and UCLRepository Fringe
 
On being a cog rather than inventing the wheel: Edinburgh DataShare as a key ...
On being a cog rather than inventing the wheel: Edinburgh DataShare as a key ...On being a cog rather than inventing the wheel: Edinburgh DataShare as a key ...
On being a cog rather than inventing the wheel: Edinburgh DataShare as a key ...EDINA, University of Edinburgh
 
Managing active research in the University of Edinburgh
Managing active research in the University of EdinburghManaging active research in the University of Edinburgh
Managing active research in the University of EdinburghRobin Rice
 

Mais procurados (20)

Introducing ORCID at Imperial College London
Introducing ORCID at Imperial College LondonIntroducing ORCID at Imperial College London
Introducing ORCID at Imperial College London
 
Ppls mvm2
Ppls mvm2Ppls mvm2
Ppls mvm2
 
Introduction to the New SUNCAT Interface
Introduction to the New SUNCAT InterfaceIntroduction to the New SUNCAT Interface
Introduction to the New SUNCAT Interface
 
Repositories Update (UK)
Repositories Update (UK) Repositories Update (UK)
Repositories Update (UK)
 
PEPRS: Recording The Extent Preserved
PEPRS: Recording The Extent PreservedPEPRS: Recording The Extent Preserved
PEPRS: Recording The Extent Preserved
 
UKLA Content Development
UKLA Content DevelopmentUKLA Content Development
UKLA Content Development
 
UKSG Conference 2016 Breakout Session - Jisc open access services to support ...
UKSG Conference 2016 Breakout Session - Jisc open access services to support ...UKSG Conference 2016 Breakout Session - Jisc open access services to support ...
UKSG Conference 2016 Breakout Session - Jisc open access services to support ...
 
Supporting the development of a national Research Data Discovery Service – a ...
Supporting the development of a national Research Data Discovery Service – a ...Supporting the development of a national Research Data Discovery Service – a ...
Supporting the development of a national Research Data Discovery Service – a ...
 
RDM through a UK lens - New Roles for Librarians?
RDM through a UK lens - New Roles for Librarians? RDM through a UK lens - New Roles for Librarians?
RDM through a UK lens - New Roles for Librarians?
 
Finalrevc
FinalrevcFinalrevc
Finalrevc
 
Jisc Publications Router: Delivering Open Access Content to Institutions
Jisc Publications Router: Delivering Open Access Content to InstitutionsJisc Publications Router: Delivering Open Access Content to Institutions
Jisc Publications Router: Delivering Open Access Content to Institutions
 
Common ground: A view of the open education landscape in Scotland
Common ground: A view of the open education landscape in Scotland Common ground: A view of the open education landscape in Scotland
Common ground: A view of the open education landscape in Scotland
 
Preparing for the UK Research Data Registry and Discovery Service
Preparing for the UK Research Data Registry and Discovery ServicePreparing for the UK Research Data Registry and Discovery Service
Preparing for the UK Research Data Registry and Discovery Service
 
Introduction to the University Data Library and national data services
Introduction to the University Data Library and national data servicesIntroduction to the University Data Library and national data services
Introduction to the University Data Library and national data services
 
DIY Research Data Management Training Kit for Librarians
DIY Research Data Management Training Kit for LibrariansDIY Research Data Management Training Kit for Librarians
DIY Research Data Management Training Kit for Librarians
 
Optimising Resources to develop a strategic approach to OA
Optimising Resources to develop a strategic approach to OAOptimising Resources to develop a strategic approach to OA
Optimising Resources to develop a strategic approach to OA
 
Implementing Open Access – BU and UCL
Implementing Open Access – BU and UCLImplementing Open Access – BU and UCL
Implementing Open Access – BU and UCL
 
Looking After Your Data: RDM @ Edinburgh
Looking After Your Data: RDM @ EdinburghLooking After Your Data: RDM @ Edinburgh
Looking After Your Data: RDM @ Edinburgh
 
On being a cog rather than inventing the wheel: Edinburgh DataShare as a key ...
On being a cog rather than inventing the wheel: Edinburgh DataShare as a key ...On being a cog rather than inventing the wheel: Edinburgh DataShare as a key ...
On being a cog rather than inventing the wheel: Edinburgh DataShare as a key ...
 
Managing active research in the University of Edinburgh
Managing active research in the University of EdinburghManaging active research in the University of Edinburgh
Managing active research in the University of Edinburgh
 

Destaque

NAGARA: SRB and iRODS
NAGARA: SRB and iRODSNAGARA: SRB and iRODS
NAGARA: SRB and iRODSMark Conrad
 
iRODS/Dataverse Project by Jonathan Crabtree
iRODS/Dataverse Project by Jonathan CrabtreeiRODS/Dataverse Project by Jonathan Crabtree
iRODS/Dataverse Project by Jonathan Crabtreedatascienceiqss
 
Research Data Management en bibliotheken
Research Data Management en bibliothekenResearch Data Management en bibliotheken
Research Data Management en bibliothekenSaskia Scheltjens
 
Data Management for Grown Ups
Data Management for Grown UpsData Management for Grown Ups
Data Management for Grown UpsAll Things Open
 
iRODS User Group Meeting 2016 - MUMC+
iRODS User Group Meeting 2016 - MUMC+iRODS User Group Meeting 2016 - MUMC+
iRODS User Group Meeting 2016 - MUMC+Maarten Coonen
 
iRODS Rule Language Cheat Sheet
iRODS Rule Language Cheat SheetiRODS Rule Language Cheat Sheet
iRODS Rule Language Cheat SheetSamuel Lampa
 
Access HDF-EOS data with OGC Web Coverage Service - Earth Observation Applica...
Access HDF-EOS data with OGC Web Coverage Service - Earth Observation Applica...Access HDF-EOS data with OGC Web Coverage Service - Earth Observation Applica...
Access HDF-EOS data with OGC Web Coverage Service - Earth Observation Applica...The HDF-EOS Tools and Information Center
 
Private Cloud Architecture
Private Cloud ArchitecturePrivate Cloud Architecture
Private Cloud ArchitectureDerek Keats
 
File management ppt
File management pptFile management ppt
File management pptmarotti
 
I rods분석(20170313,01,김선태)
I rods분석(20170313,01,김선태)I rods분석(20170313,01,김선태)
I rods분석(20170313,01,김선태)Suntae Kim
 
White Paper: Life Sciences at RENCI, Big Data IT to Manage, Decipher and Info...
White Paper: Life Sciences at RENCI, Big Data IT to Manage, Decipher and Info...White Paper: Life Sciences at RENCI, Big Data IT to Manage, Decipher and Info...
White Paper: Life Sciences at RENCI, Big Data IT to Manage, Decipher and Info...EMC
 
Operating Systems - File Management
Operating Systems -  File ManagementOperating Systems -  File Management
Operating Systems - File ManagementDamian T. Gordon
 

Destaque (16)

NAGARA: SRB and iRODS
NAGARA: SRB and iRODSNAGARA: SRB and iRODS
NAGARA: SRB and iRODS
 
iRODS/Dataverse Project by Jonathan Crabtree
iRODS/Dataverse Project by Jonathan CrabtreeiRODS/Dataverse Project by Jonathan Crabtree
iRODS/Dataverse Project by Jonathan Crabtree
 
Research Data Management en bibliotheken
Research Data Management en bibliothekenResearch Data Management en bibliotheken
Research Data Management en bibliotheken
 
Data Management for Grown Ups
Data Management for Grown UpsData Management for Grown Ups
Data Management for Grown Ups
 
UDT
UDTUDT
UDT
 
ODSC and iRODS
ODSC and iRODSODSC and iRODS
ODSC and iRODS
 
iRODS User Group Meeting 2016 - MUMC+
iRODS User Group Meeting 2016 - MUMC+iRODS User Group Meeting 2016 - MUMC+
iRODS User Group Meeting 2016 - MUMC+
 
iRODS Rule Language Cheat Sheet
iRODS Rule Language Cheat SheetiRODS Rule Language Cheat Sheet
iRODS Rule Language Cheat Sheet
 
HDF5 iRODS
HDF5 iRODSHDF5 iRODS
HDF5 iRODS
 
Access HDF-EOS data with OGC Web Coverage Service - Earth Observation Applica...
Access HDF-EOS data with OGC Web Coverage Service - Earth Observation Applica...Access HDF-EOS data with OGC Web Coverage Service - Earth Observation Applica...
Access HDF-EOS data with OGC Web Coverage Service - Earth Observation Applica...
 
iRODS: Interoperability in Data Management
iRODS: Interoperability in Data ManagementiRODS: Interoperability in Data Management
iRODS: Interoperability in Data Management
 
Private Cloud Architecture
Private Cloud ArchitecturePrivate Cloud Architecture
Private Cloud Architecture
 
File management ppt
File management pptFile management ppt
File management ppt
 
I rods분석(20170313,01,김선태)
I rods분석(20170313,01,김선태)I rods분석(20170313,01,김선태)
I rods분석(20170313,01,김선태)
 
White Paper: Life Sciences at RENCI, Big Data IT to Manage, Decipher and Info...
White Paper: Life Sciences at RENCI, Big Data IT to Manage, Decipher and Info...White Paper: Life Sciences at RENCI, Big Data IT to Manage, Decipher and Info...
White Paper: Life Sciences at RENCI, Big Data IT to Manage, Decipher and Info...
 
Operating Systems - File Management
Operating Systems -  File ManagementOperating Systems -  File Management
Operating Systems - File Management
 

Semelhante a Green Shoots: Research Data Management Pilot at Imperial College London

Research Data Management at Imperial College London
Research Data Management at Imperial College LondonResearch Data Management at Imperial College London
Research Data Management at Imperial College LondonSarah Anna Stewart
 
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...Sarah Anna Stewart
 
ChemSpider – disseminating data and enabling an abundance of chemistry platforms
ChemSpider – disseminating data and enabling an abundance of chemistry platformsChemSpider – disseminating data and enabling an abundance of chemistry platforms
ChemSpider – disseminating data and enabling an abundance of chemistry platformsKen Karapetyan
 
Birgit Plietzsch “RDM within research computing support” SALCTG June 2013
Birgit Plietzsch “RDM within research computing support” SALCTG June 2013Birgit Plietzsch “RDM within research computing support” SALCTG June 2013
Birgit Plietzsch “RDM within research computing support” SALCTG June 2013SALCTG
 
RDM Roadmap to the Future, or: Lords and Ladies of the Data
RDM Roadmap to the Future, or: Lords and Ladies of the DataRDM Roadmap to the Future, or: Lords and Ladies of the Data
RDM Roadmap to the Future, or: Lords and Ladies of the DataRobin Rice
 
Supporting the development of a national Research Data Discovery Service - A ...
Supporting the development of a national Research Data Discovery Service - A ...Supporting the development of a national Research Data Discovery Service - A ...
Supporting the development of a national Research Data Discovery Service - A ...Historic Environment Scotland
 
High Performance Data Analytics and a Java Grande Run Time
High Performance Data Analytics and a Java Grande Run TimeHigh Performance Data Analytics and a Java Grande Run Time
High Performance Data Analytics and a Java Grande Run TimeGeoffrey Fox
 
Research Data Services @ Edinburgh: MANTRA & Edinburgh DataShare
Research Data Services @ Edinburgh: MANTRA & Edinburgh DataShareResearch Data Services @ Edinburgh: MANTRA & Edinburgh DataShare
Research Data Services @ Edinburgh: MANTRA & Edinburgh DataShareHistoric Environment Scotland
 
Facing the data challenge: Developing data policy & services
Facing the data challenge: Developing data policy & servicesFacing the data challenge: Developing data policy & services
Facing the data challenge: Developing data policy & servicesMarieke Guy
 
Repository Fringe 2015 - Jisc RDM Session, Linda Naughton, Jisc
Repository Fringe 2015 - Jisc RDM Session, Linda Naughton, JiscRepository Fringe 2015 - Jisc RDM Session, Linda Naughton, Jisc
Repository Fringe 2015 - Jisc RDM Session, Linda Naughton, JiscRepository Fringe
 
Building data networks: exploring trust and interoperability between authoris...
Building data networks: exploring trust and interoperability between authoris...Building data networks: exploring trust and interoperability between authoris...
Building data networks: exploring trust and interoperability between authoris...Repository Fringe
 
Engaging with students and researchers: the case of the social sciences
Engaging with students and researchers: the case of the social sciencesEngaging with students and researchers: the case of the social sciences
Engaging with students and researchers: the case of the social sciencesLouise Corti
 
Graham Pryor
Graham PryorGraham Pryor
Graham PryorEduserv
 
Scottish Digital Library Consortium Meeting: Edinburgh DataShare
Scottish Digital Library Consortium Meeting: Edinburgh DataShareScottish Digital Library Consortium Meeting: Edinburgh DataShare
Scottish Digital Library Consortium Meeting: Edinburgh DataShareRobin Rice
 

Semelhante a Green Shoots: Research Data Management Pilot at Imperial College London (20)

Research Data Management at Imperial College London
Research Data Management at Imperial College LondonResearch Data Management at Imperial College London
Research Data Management at Imperial College London
 
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...
 
RDM & ELNs @ Edinburgh
RDM & ELNs @ EdinburghRDM & ELNs @ Edinburgh
RDM & ELNs @ Edinburgh
 
ChemSpider – disseminating data and enabling an abundance of chemistry platforms
ChemSpider – disseminating data and enabling an abundance of chemistry platformsChemSpider – disseminating data and enabling an abundance of chemistry platforms
ChemSpider – disseminating data and enabling an abundance of chemistry platforms
 
Birgit Plietzsch “RDM within research computing support” SALCTG June 2013
Birgit Plietzsch “RDM within research computing support” SALCTG June 2013Birgit Plietzsch “RDM within research computing support” SALCTG June 2013
Birgit Plietzsch “RDM within research computing support” SALCTG June 2013
 
RDM Roadmap to the Future, or: Lords and Ladies of the Data
RDM Roadmap to the Future, or: Lords and Ladies of the DataRDM Roadmap to the Future, or: Lords and Ladies of the Data
RDM Roadmap to the Future, or: Lords and Ladies of the Data
 
RDM@Edinburgh
RDM@EdinburghRDM@Edinburgh
RDM@Edinburgh
 
Supporting the development of a national Research Data Discovery Service - A ...
Supporting the development of a national Research Data Discovery Service - A ...Supporting the development of a national Research Data Discovery Service - A ...
Supporting the development of a national Research Data Discovery Service - A ...
 
RDM@Edinburgh
RDM@EdinburghRDM@Edinburgh
RDM@Edinburgh
 
Research Data Management: Why is it important?
Research Data Management: Why is it  important?Research Data Management: Why is it  important?
Research Data Management: Why is it important?
 
High Performance Data Analytics and a Java Grande Run Time
High Performance Data Analytics and a Java Grande Run TimeHigh Performance Data Analytics and a Java Grande Run Time
High Performance Data Analytics and a Java Grande Run Time
 
Research Data Services @ Edinburgh: MANTRA & Edinburgh DataShare
Research Data Services @ Edinburgh: MANTRA & Edinburgh DataShareResearch Data Services @ Edinburgh: MANTRA & Edinburgh DataShare
Research Data Services @ Edinburgh: MANTRA & Edinburgh DataShare
 
Facing the data challenge: Developing data policy & services
Facing the data challenge: Developing data policy & servicesFacing the data challenge: Developing data policy & services
Facing the data challenge: Developing data policy & services
 
Repository Fringe 2015 - Jisc RDM Session, Linda Naughton, Jisc
Repository Fringe 2015 - Jisc RDM Session, Linda Naughton, JiscRepository Fringe 2015 - Jisc RDM Session, Linda Naughton, Jisc
Repository Fringe 2015 - Jisc RDM Session, Linda Naughton, Jisc
 
Building data networks: exploring trust and interoperability between authoris...
Building data networks: exploring trust and interoperability between authoris...Building data networks: exploring trust and interoperability between authoris...
Building data networks: exploring trust and interoperability between authoris...
 
Engaging with students and researchers: the case of the social sciences
Engaging with students and researchers: the case of the social sciencesEngaging with students and researchers: the case of the social sciences
Engaging with students and researchers: the case of the social sciences
 
Introduction to Research Data Management
Introduction to Research Data ManagementIntroduction to Research Data Management
Introduction to Research Data Management
 
RDM Programme @ Edinburgh
RDM Programme @ Edinburgh RDM Programme @ Edinburgh
RDM Programme @ Edinburgh
 
Graham Pryor
Graham PryorGraham Pryor
Graham Pryor
 
Scottish Digital Library Consortium Meeting: Edinburgh DataShare
Scottish Digital Library Consortium Meeting: Edinburgh DataShareScottish Digital Library Consortium Meeting: Edinburgh DataShare
Scottish Digital Library Consortium Meeting: Edinburgh DataShare
 

Mais de Torsten Reimer

Does anybody care about digital preservation? Digital preservation from a per...
Does anybody care about digital preservation? Digital preservation from a per...Does anybody care about digital preservation? Digital preservation from a per...
Does anybody care about digital preservation? Digital preservation from a per...Torsten Reimer
 
A Manifesto for the Digital Shift in Research Libraries
A Manifesto for the Digital Shift in Research LibrariesA Manifesto for the Digital Shift in Research Libraries
A Manifesto for the Digital Shift in Research LibrariesTorsten Reimer
 
Researching researchers Delivering a systematic user research programme in a ...
Researching researchers Delivering a systematic user research programme in a ...Researching researchers Delivering a systematic user research programme in a ...
Researching researchers Delivering a systematic user research programme in a ...Torsten Reimer
 
The once and future library: will there be, and what might a research library...
The once and future library: will there be, and what might a research library...The once and future library: will there be, and what might a research library...
The once and future library: will there be, and what might a research library...Torsten Reimer
 
For repositories to succeed they have to end. Reflections on (not just) the U...
For repositories to succeed they have to end. Reflections on (not just) the U...For repositories to succeed they have to end. Reflections on (not just) the U...
For repositories to succeed they have to end. Reflections on (not just) the U...Torsten Reimer
 
Making ‘Everything Available’ – Transforming the (online) services and experi...
Making ‘Everything Available’ – Transforming the (online) services and experi...Making ‘Everything Available’ – Transforming the (online) services and experi...
Making ‘Everything Available’ – Transforming the (online) services and experi...Torsten Reimer
 
Imperial College ORCID project
Imperial College ORCID projectImperial College ORCID project
Imperial College ORCID projectTorsten Reimer
 
ORCID - A University Perspective
ORCID - A University PerspectiveORCID - A University Perspective
ORCID - A University PerspectiveTorsten Reimer
 

Mais de Torsten Reimer (8)

Does anybody care about digital preservation? Digital preservation from a per...
Does anybody care about digital preservation? Digital preservation from a per...Does anybody care about digital preservation? Digital preservation from a per...
Does anybody care about digital preservation? Digital preservation from a per...
 
A Manifesto for the Digital Shift in Research Libraries
A Manifesto for the Digital Shift in Research LibrariesA Manifesto for the Digital Shift in Research Libraries
A Manifesto for the Digital Shift in Research Libraries
 
Researching researchers Delivering a systematic user research programme in a ...
Researching researchers Delivering a systematic user research programme in a ...Researching researchers Delivering a systematic user research programme in a ...
Researching researchers Delivering a systematic user research programme in a ...
 
The once and future library: will there be, and what might a research library...
The once and future library: will there be, and what might a research library...The once and future library: will there be, and what might a research library...
The once and future library: will there be, and what might a research library...
 
For repositories to succeed they have to end. Reflections on (not just) the U...
For repositories to succeed they have to end. Reflections on (not just) the U...For repositories to succeed they have to end. Reflections on (not just) the U...
For repositories to succeed they have to end. Reflections on (not just) the U...
 
Making ‘Everything Available’ – Transforming the (online) services and experi...
Making ‘Everything Available’ – Transforming the (online) services and experi...Making ‘Everything Available’ – Transforming the (online) services and experi...
Making ‘Everything Available’ – Transforming the (online) services and experi...
 
Imperial College ORCID project
Imperial College ORCID projectImperial College ORCID project
Imperial College ORCID project
 
ORCID - A University Perspective
ORCID - A University PerspectiveORCID - A University Perspective
ORCID - A University Perspective
 

Último

ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxiammrhaywood
 
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdfInclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdfTechSoup
 
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATIONTHEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATIONHumphrey A Beña
 
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptxINTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptxHumphrey A Beña
 
Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxthorishapillay1
 
Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Celine George
 
Karra SKD Conference Presentation Revised.pptx
Karra SKD Conference Presentation Revised.pptxKarra SKD Conference Presentation Revised.pptx
Karra SKD Conference Presentation Revised.pptxAshokKarra1
 
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITYISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITYKayeClaireEstoconing
 
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTSGRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTSJoshuaGantuangco2
 
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdfLike-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdfMr Bounab Samir
 
Science 7 Quarter 4 Module 2: Natural Resources.pptx
Science 7 Quarter 4 Module 2: Natural Resources.pptxScience 7 Quarter 4 Module 2: Natural Resources.pptx
Science 7 Quarter 4 Module 2: Natural Resources.pptxMaryGraceBautista27
 
Q4 English4 Week3 PPT Melcnmg-based.pptx
Q4 English4 Week3 PPT Melcnmg-based.pptxQ4 English4 Week3 PPT Melcnmg-based.pptx
Q4 English4 Week3 PPT Melcnmg-based.pptxnelietumpap1
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdfAMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdfphamnguyenenglishnb
 
How to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPHow to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPCeline George
 
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...Nguyen Thanh Tu Collection
 
How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17Celine George
 

Último (20)

ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
 
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdfInclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
 
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATIONTHEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
 
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptxINTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
 
Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptx
 
Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17
 
Karra SKD Conference Presentation Revised.pptx
Karra SKD Conference Presentation Revised.pptxKarra SKD Conference Presentation Revised.pptx
Karra SKD Conference Presentation Revised.pptx
 
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITYISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
 
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTSGRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
 
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdfLike-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
 
YOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptx
YOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptxYOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptx
YOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptx
 
Science 7 Quarter 4 Module 2: Natural Resources.pptx
Science 7 Quarter 4 Module 2: Natural Resources.pptxScience 7 Quarter 4 Module 2: Natural Resources.pptx
Science 7 Quarter 4 Module 2: Natural Resources.pptx
 
Q4 English4 Week3 PPT Melcnmg-based.pptx
Q4 English4 Week3 PPT Melcnmg-based.pptxQ4 English4 Week3 PPT Melcnmg-based.pptx
Q4 English4 Week3 PPT Melcnmg-based.pptx
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
 
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdfAMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
 
How to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPHow to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERP
 
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
 
YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptxYOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
 
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdfTataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
 
How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17
 

Green Shoots: Research Data Management Pilot at Imperial College London

  • 1. Green Shoots: RDM Pilot at Imperial College London Ian McArdle Head of Research Systems & Information i.mcardle@imperial.ac.uk Torsten Reimer Project Manager: Open Access & RDM t.reimer@imperial.ac.uk Presenting projects by: M. Bearpark & C. Fare; G. Thomas, S. Butcher & C. Tomlinson; M. Mueller; H. S. Rzepa, M. J. Harvey, N. Mason & A. Mclean; G. Gorman, C. T. Jacobs & A. Avdis; N. Jones www.imperial.ac.uk/researchsupport/rdm/policy/greenshoots IDCC15, 10th February 2015
  • 2. Imperial College London • Seven London campuses • Four Faculties: Engineering, Medicine, Natural Sciences and Business School • Ranked 2nd in world (QS University Ranking) • Net income (2014): £855m, incl. £351m research grants and contracts • ~15,000 students, ~7,200 staff, incl. ~3,700 academic & research staff • Staff publish ~10,000 scholarly articles per year http://www.imperial.ac.uk/
  • 3. College Position Statement “Imperial College London is committed to promoting the highest standards of academic research, including excellence in research data management. This includes a robust digital curation infrastructure that supports open data access and protects confidential data. Imperial acknowledges legal, ethical and commercial constraints on data sharing and the need to preserve the academic entitlement to publication.” - Approved by Provost Board, publicised via Staff Briefing
  • 4. Investing in RDM “Green Shoots” scheme is born So where, specifically, should the College invest? Considering large research income and reputation, Imperial cannot afford to “get it wrong” College acknowledges that excellence in RDM will require significant investment and academic engagement
  • 5. “Green Shoots” Funding - £100K Investment What did we want? • Academically-driven projects to demonstrate best practice in RDM • Specifically frameworks / prototypes that would comply with funder policies and College position • Frameworks could be based either on original ideas or integrating existing solutions into the research process • Projects that supported Open Innovation and open access for data What did we hope to achieve? • Encourage a “bottoms up” approach to maximise use of local early adopters and innovators • Generate solutions that could be grown to support RDM more widely • Demonstrate that innovative, academically-driven, beneficial RDM is possible and to stimulate this further • Advice concerning how Imperial should proceed in supporting RDM
  • 6. FUNDING OPPORTUNITY: Research Data Management More Information: http://www.imperial.ac.uk/researchstrategy/funding Contact: Ian McArdle i.mcardle@imperial.ac.uk Submission Deadline: Friday 28th March 2014 Funding is available for academically-driven projects to identify and generate exemplars of best practice in Research Data Management (RDM), specifically frameworks and prototypes that comply with key funder RDM policies and the College position. There is an expectation that solutions will support open access for data and solutions that support Open Innovation are strongly encouraged.
  • 7. Funded Projects • Haystack – A Computational Molecular Data Notebook • M. Bearpark & C. Fare • The Imperial College Tissue Bank: A Searchable Catalogue for Tissues, Research Projects and Data Outcomes • G. Thomas, S. Butcher & C. Tomlinson • Integrated Rule-Based Data Management System for Genome Sequencing Data • M. Mueller • Research Data Management in Computational and Experimental Molecular Science • H. S. Rzepa, M. J. Harvey, N. Mason & A. Mclean • Research Data Management: Where Software Meets Data • G. Gorman, C. T. Jacobs & A. Avdis • Research Data Management: Placing [Time Series] Data in its Context • N. Jones
  • 8. Haystack – A Computational Molecular Data Notebook M. Bearpark & C. Fare Idea • Extend a working prototype of a computational chemical IPython notebook making it available for all on github Achievements • Installation is now much simplified • A tree document structure has been implemented • Calculations using mainstream computational chemistry software can be set up • Calculations can be submitted to run on a high-performance computing cluster • Data from completed calculations can be retrieved and visualised RDM Benefits • Enables computational molecular researchers to easily share a curated subset of their results and document how those results were generated More Information • http://github.com/clyde-fare/cc_notebook
  • 9. Imperial College Tissue Bank: A Searchable Catalogue for Tissues, Research Projects and Data Outcomes G. Thomas, S. Butcher & C. Tomlinson Idea • Extend the ICH tissue bank infrastructure to accept and catalogue research data alongside the collection of 60,000 physical tissues specimens and donor records Achievements • A tool to automatically exchange data with the National Cancer Registry was built, updating patient outcome data where known • A pipeline to transfer summary sequencing data and metadata into the tissue bank and a UI to view this information • Prototyped a means for tracking location of associated raw sequencing data for future development • Began to investigate means to link publications back to associated tissue samples RDM Benefits • Enhances existing datasets and enables their reuse to maximise the benefits gained from each tissue sample More Information • http://www.imperial.ac.uk/tissuebank/
  • 10. Integrated Rule-Based Data Management System for Genome Sequencing Data M. Mueller Idea • Set up a data management system for the DNA sequencing service that will integrate with existing central Imperial HPC infrastructure for processing, analysis and dissemination of raw data and analysis results Achievements • See system on following slide • iRODS-based system was implemented that: • 1 – Transfers data from sequencer to HPC Service (different campus) • 2 – Data are reformatted and split by sample and project and a quality report generated • 3 – Reads are mapped to a reference genome, reformatting again, reducing file size • 4 – Further compression achieved via compression algorithm • 5 – Data transferred to a webserver and made available for download • Overcame concerns over authentication by excluding the HPC storage from iRODS RDM Benefits • A robust infrastructure is now in place to effectively manage large volumes of complex sequencing data • The data are being made publicly available for re-use of this expensive resource More Information • http://www.imperial.ac.uk/genomicsfacility/informatics/
  • 11. Integrated Rule-Based Data Management System for Genome Sequencing Data M. Mueller
  • 12. Research Data Management in Computational and Experimental Molecular Science H. S. Rzepa, M. J. Harvey, N. Mason & A. Mclean Idea • Address sustainability and scalability of a hub interfacing electronic lab notebooks with HPC resources and digital data repositories Achievements • Produced an installer package to allow reuse of uportal DSpace front end • Enhanced metadata in local repository to make it compliant with DataCite specifications – all repository content automatically receives a DOI • Integrated ORCID into their solution • Developed a procedure using DOIs for directly retrieving data from a digital repository and displaying it using Javascript components • Curated 170,000 datasets from Cambridge to Imperial, adding standards-based metadata RDM Benefits • Molecular data can be referenced more robustly with persistent identifiers – step forward in data citation More Information • http://doi.org/10042/a3v1w
  • 13. Research Data Management in Computational and Experimental Molecular Science H. S. Rzepa, M. J. Harvey, N. Mason & A. Mclean
  • 14. Research Data Management: Where Software Meets Data G. Gorman, C. T. Jacobs & A. Avdis Idea • Integrating research data management into the research workflow so that data and software can be curated at the push of a button using Figshare and Git Achievements • Developed and released an open source software library: PyRDM • Automatically transfers software source code (stored under Git control) and data to Figshare • Figshare generates a DOI for that code version and the data • Metadata including author details and cross-referencing between code and data are uploaded automatically • Hoping for ORCID authentication via Figshare API to be added • PyRDM was integrated into the Fluidity computational fluid dynamics code • DOIs minted are stored in Fluidity to improve data provenance and allow a new revision of the repository to be created if the data are updated at a later stage RDM Benefits • Research data published in line with funder expectations • The DOI for a specific code version enables better recomputability of data • Automated metadata generation reduces academic burden More Information • http://github.com/pyrdm http://dx.doi.org/10.5334/jors.bj www.fluidity-project.org
  • 15. Research Data Management: Placing [Time Series] Data in its Context N. Jones Idea • Provide a platform and technology which automatically connects researchers through their time-series data, models and analysis methods Achievements • Online interdisciplinary collection of time-series data and time-series analysis code • Functionality to automatically profile time series • Functionality to automatically profile time series algorithms • Functionality to use these profiles to place a user’s work in the context of others RDM Benefits • Incentivises data sharing by allowing data comparison – increases discoverability of an academic’s data plus increases likelihood of finding other relevant data • Resource also available to general public More Information • http://www.comp-engine.org/timeseries/
  • 16. Research Data Management: Placing [Time Series] Data in its Context N. Jones
  • 17. Overall Conclusions Good data curation is HARD and EXPENSIVE Development of sustainable research software is also HARD and EXPENSIVE Data citation is important Immediate incentives help APIs useful preferably open Auto-generation of metadata E lab books seem useful Clinical data is a minefield Nucleus of an RDM community at Imperial Ideas to consider for wider deployment for cross-College benefit
  • 18. Thanks and Questions Review of applications: • Kevin Ashley, DCC Director Green Shoots academics: • M. Bearpark & C. Fare • G. Thomas, S. Butcher & C. Tomlinson • M. Mueller • H. S. Rzepa, M. J. Harvey, N. Mason & A. Mclean • G. Gorman, C. T. Jacobs & A. Avdis • N. Jones Provision of funds: • Imperial Vice-Provost Advisory Group: Research