class: center, middle, inverse, title-slide # MetaData can be easy ### Rainer M. Krug & Owen L. Petchey ### Department of Evolutionary Biology and Environmental Studies, UZH ### 2019/09/11 (updated: 2019-09-09) --- class: bottom, left background-image: url(./fig/5141328136_16d8f73ee7_b_BW.jpg) background-size: cover .dt[ Image credit: [randylane](http://randylane.com/wp-content/uploads/2014/09/5141328136_16d8f73ee7_b.jpg) ] <style type="text/css"> /* custom.css */ .title-slide { # background-color: coral; background-image: url("fig/uzh_logo_e_pos.svg"), url("fig/qr_talk.svg"), url("https://zenodo.org/badge/DOI/10.5281/zenodo.3386062.svg"); background-size: 20%, 10%, 20%; background-position: 1% 1%, 99% 1%, 1% 99%; } .alert { color: #FF0000; text-align: center; font-size: xx-large; float: middle; } .red { color: #FF0000; } .blue { color: #0000FF; } .dt { font-size: x-small; position: absolute; bottom: 10px; width: 50%; }] </style> --- class: bottom, left background-image: url(./fig/FAIR_EN.png) background-size: contain .dt[ Image credit: [ogsl](https://ogsl.ca/sites/default/files/FAIR_EN.png) ] see [Force11](https://www.force11.org/group/fairgroup/fairprinciples) for more info ??? TO BE FINDABLE: F1. (meta)data are assigned a globally unique and eternally persistent identifier. **F2. data are described with rich metadata.** F3. (meta)data are registered or indexed in a searchable resource. F4. metadata specify the data identifier. TO BE ACCESSIBLE: A1 (meta)data are retrievable by their identifier using a standardized communications protocol. A1.1 the protocol is open, free, and universally implementable. A1.2 the protocol allows for an authentication and authorization procedure, where necessary. **A2 metadata are accessible, even when the data are no longer available.** TO BE INTEROPERABLE: I1. (meta)data use a formal, accessible, shared, and broadly applicable language for knowledge representation. **I2. (meta)data use vocabularies that follow FAIR principles.** I3. (meta)data include qualified references to other (meta)data. TO BE RE-USABLE: **R1. meta(data) have a plurality of accurate and relevant attributes.** R1.1. (meta)data are released with a clear and accessible data usage license. R1.2. (meta)data are associated with their provenance. R1.3. (meta)data meet domain-relevant community standards. --- class: center, top # Why do we need this? -- .left[ - Whose MSc / PhD / first research project was more than 3 years ago? ] -- .left[ - Who remembers - where the data is and if it is still readable? - what the column names mean? - which units were used? - how the analysis was done (script)? ] -- # .red[How can you re-use your data?] -- # .red[How can anybody else re-use your data?] --- class: bottom, left background-image: url(./fig/5141328136_16d8f73ee7_b_BW.jpg) background-size: cover .dt[ Image credit: [randylane](http://randylane.com/wp-content/uploads/2014/09/5141328136_16d8f73ee7_b.jpg) ] --- class: bottom, left background-image: url(./fig/5141328136_16d8f73ee7_b.jpg) background-size: cover .dt[ Image credit: [randylane](http://randylane.com/wp-content/uploads/2014/09/5141328136_16d8f73ee7_b.jpg) ] --- class: bottom, left background-image: url(./fig/Person-tree.jpg) background-size: contain .dt[ Image credit: [Pooktree](https://en.wikipedia.org/wiki/Tree#/media/File:Person-tree.jpg) ] --- class: bottom, left background-image: url(./fig/birch.jpg) background-size: contain .dt[ Image credit: [Artplants](https://www.amazon.co.uk/Birch-Foliage-Spray-leaves-green/dp/B00FMQR884) ] --- class: bottom, left background-image: url(./fig/Poisot2019.png) background-size: contain [Poisot et al. (2019)](https://doi.org/10.1016/j.tree.2019.04.005) --- class: top background-image: url(./fig/birch.jpg) background-size: 60% background-position: 112% 50% .dt[ Image credit: [Pooktree](https://en.wikipedia.org/wiki/Tree#/media/File:Person-tree.jpg) ] ## Two aspects -- 1. .red[**Developing**] a new Domain Specific MetaData Scheme -- 2. .red[**Using**] the new Domain Specific MetaData Scheme --- ## Tool(s) which will help to ??? What do we know: Spreadsheets, Web Browser & R -- #### develop a new Domain Specific MetaData Scheme [.blue[ developer & scientists]: .red[R & Spreadsheet]] - together with scientists - encourage buy in & "ownership" of scheme ??? - As simple as possible, but as complex as necessary - Spreadsheet to define scheme -- #### enter MetaData [.blue[ user]: .red[browser & Spreadsheet]] - easy, fun & transparent ??? - Spreadsheet to enter data -- #### validate MetaData [.blue[ user]: .red[browser]] - report of errors and inconsistencies ??? - Validation report as html or pdf -- #### export MetaData [.blue[ user]: .red[browser]] - formats suitable for repository submission & indexing ??? - at the moment only .xml --- class: top background-image: url(./fig/Rlogo.png) background-size: 20% background-position: top right ## Use programming language R -- ## Two packages -- 1. **scheme generator - developer - `dmdScheme`** - template scheme, - base functionality for - entering, - validating and - exporting the metadata and - tools to develop new schemes and packages -- 2. **actual scheme - user - e.g. `emeScheme`** - a specific scheme and can contain - additional functionality for - entering, - validating and - exporting the metadata and - additional specific tools ??? Let's look at the actual spreaadsheet --- class: center, top background-image: url(./fig/Spreadsheet_Experiment.png) background-size: 107% background-position: middle center ??? - Experiment based - this one is needed --- class: center, top background-image: url(./fig/Spreadsheet_Species.png) background-size: 107% background-position: middle center ??? - sheet can be modified - sheets can be removed or added --- class: center, top background-image: url(./fig/Spreadsheet_DataFileMetaData.png) background-size: 107% background-position: middle center ??? - link to data files - sheet is needed --- class: center, top background-image: url(./fig/ValidationReport.png) background-size: 105% background-position: middle center --- class: inverse, center, middle # How to develop <br/> a new Domain Specific MetaData Scheme (scheme) developer perspective (basic) R knowledge necessary --- class: center, top background-image: url(./fig/workflow_newScheme.png) background-size: contain background-position: top center --- class: inverse, center, middle ## Live Demo How to Develop a New Domain Specific MetaData Scheme --- class: inverse, center, middle # How to use <br/> the new Domain Specific MetaData Scheme from a user / researcher perspective no R knowledge necessary basic GUI (shiny app) is available also possible from R --- class: center, top background-image: url(./fig/ShinyApp.png) background-size: 43% background-position: 50% 25% --- class: center, top background-image: url(./fig/workflow_enterMetaData.png) background-size: contain background-position: top center --- class: inverse, center, middle # Live Demo How to Enter New MetaData --- class: center, top background-image: url(./fig/qr_email.svg) background-size: 10% background-position: 80% 100% # Where to go from Here? -- ### Spread the Word and Develop `dmdSchemes` -- ### Search Engine for all `dmdScheme` ### Web Based Repository of `dmdScheme` Definitions ### Integrate in Existing Repositories -- ## Are you interested in participating? email me: ## Rainer.Krug@uzh.ch --- class: center, top background-image: url(./fig/Group_2019.jpg) background-size: cover background-position: bßottom center A .red[BIG THANK YOU] to everybody in the our group -- Particularly the Guinea Pigs --- class: center, bottom background-image: url(./fig/qr_talk.svg),url(./fig/qr_dmdScheme.svg),url(./fig/big_coffee.jpg),url("https://zenodo.org/badge/DOI/10.5281/zenodo.3386062.svg"),url("https://zenodo.org/badge/DOI/10.5281/zenodo.3229615.svg") background-size: 10%, 10%, 80%, 20%, 20% background-position: 25% 98%, 89% 98%, 50% 0%, 1% 95%, 65% 95% # Talk <span style="padding-left:350px"> `dmdScheme` <span style="padding-left:210px"> .dt[ Image credit: [eater.com](https://www.eater.com/2014/10/30/7131255/starbucks-free-drink-record-83-101-shots-espresso) ]