class: center, middle, inverse, title-slide # From Zero to Hero ## Metadata can be Easy! ### Rainer M. Krug ### Department of Evolutionary Biology and Environmental Studies, UZH ### 2019/10/24 (updated: 2019-10-24) --- class: bottom, left background-image: url(./fig/metadata.png) background-size: cover .dt[ Image credit: [dataedo](https://dataedo.com/kb/data-glossary/what-is-metadata) ] <style type="text/css"> /* custom.css */ .title-slide { # background-color: coral; background-image: url("fig/uzh_logo_e_pos.svg"), url("fig/qr_talk.svg"), url("https://zenodo.org/badge/DOI/10.5281/zenodo.3516850.svg"); background-size: 20%, 10%, 20%; background-position: 1% 1%, 99% 1%, 1% 99%; } .alert { color: #FF0000; text-align: center; font-size: xx-large; float: middle; } .red { color: #FF0000; } .blue { color: #0000FF; } .dt { font-size: x-small; position: absolute; bottom: 10px; width: 50%; }] </style> --- background-image: url(./fig/5141328136_16d8f73ee7_b_BW.jpg) background-size: cover .dt[ Image credit: [randylane](http://randylane.com/wp-content/uploads/2014/09/5141328136_16d8f73ee7_b.jpg) ] --- class: bottom, left background-image: url(./fig/IceAge.jpg) background-size: cover .dt[ Image credit: [Ice Age](https://i.ytimg.com/vi/YsUyzkjOY_c/maxresdefault.jpg) ] --- class: bottom, left background-image: url(./fig/FAIR_EN.png) background-size: contain .dt[ Image credit: [ogsl](https://ogsl.ca/sites/default/files/FAIR_EN.png) ] see [Force11](https://www.force11.org/group/fairgroup/fairprinciples) for more info ??? TO BE FINDABLE: F1. (meta)data are assigned a globally unique and eternally persistent identifier. **F2. data are described with rich metadata.** F3. (meta)data are registered or indexed in a searchable resource. F4. metadata specify the data identifier. TO BE ACCESSIBLE: A1 (meta)data are retrievable by their identifier using a standardized communications protocol. A1.1 the protocol is open, free, and universally implementable. A1.2 the protocol allows for an authentication and authorization procedure, where necessary. **A2 metadata are accessible, even when the data are no longer available.** TO BE INTEROPERABLE: I1. (meta)data use a formal, accessible, shared, and broadly applicable language for knowledge representation. **I2. (meta)data use vocabularies that follow FAIR principles.** I3. (meta)data include qualified references to other (meta)data. TO BE RE-USABLE: **R1. meta(data) have a plurality of accurate and relevant attributes.** R1.1. (meta)data are released with a clear and accessible data usage license. R1.2. (meta)data are associated with their provenance. R1.3. (meta)data meet domain-relevant community standards. --- class: center, top # Why do we need this? -- .left[ - Whose MSc / PhD / first research project was more than 3 years ago? ] -- .left[ - Who remembers - where the data is and if it is still readable? - what the column names mean? - which units were used? - how the analysis was done (script)? ] -- # .red[How can you re-use your data?] -- # .red[How can anybody else re-use your data?] --- class: bottom, left background-image: url(./fig/5141328136_16d8f73ee7_b_BW.jpg) background-size: cover .dt[ Image credit: [randylane](http://randylane.com/wp-content/uploads/2014/09/5141328136_16d8f73ee7_b.jpg) ] --- class: bottom, left background-image: url(./fig/pile-of-paper.png) background-size: cover .dt[ Image credit: [lunchticket](https://lunchticket.org/importance-following-submission-guidelines/pile-of-paper1/) ] --- class: bottom, left background-image: url(./fig/screaming.jpg) background-size: cover .dt[ Image credit: [istockphoto](https://media.istockphoto.com/photos/woman-scientist-screaming-picture-id616093188?k=6&m=616093188&s=612x612&w=0&h=1nrZy_8hdnd0BxG1_q60kBbFYmCrrPkyDe_LvKurKjU=) ] --- class: bottom, left background-image: url(./fig/happy-scientists.jpg) background-size: cover .dt[ Image credit: [edgeforscholars](https://edgeforscholars.org/wp-content/uploads/2017/04/happy-scientists.jpg) ] --- class: bottom, left background-image: url(./fig/5141328136_16d8f73ee7_b_BW.jpg) background-size: cover .dt[ Image credit: [randylane](http://randylane.com/wp-content/uploads/2014/09/5141328136_16d8f73ee7_b.jpg) ] --- class: bottom, left background-image: url(./fig/5141328136_16d8f73ee7_b.jpg) background-size: cover .dt[ Image credit: [randylane](http://randylane.com/wp-content/uploads/2014/09/5141328136_16d8f73ee7_b.jpg) ] --- class: bottom, left background-image: url(./fig/Person-tree.jpg) background-size: contain .dt[ Image credit: [Pooktree](https://en.wikipedia.org/wiki/Tree#/media/File:Person-tree.jpg) ] ??? * MetaData Scheme * Ecological Metadata Lalnguage * Darwin Core --- class: bottom, left background-image: url(./fig/Tree-Pruning-diagram.jpg) background-size: contain .dt[ Image credit: [quickcrop](https://www.quickcrop.ie/blog/2018/03/pruning-apple-trees-2/) ] --- class: bottom, left background-image: url(./fig/birch.jpg) background-size: contain .dt[ Image credit: [Artplants](https://www.amazon.co.uk/Birch-Foliage-Spray-leaves-green/dp/B00FMQR884) ] ??? * Domain Specific Metadata Schemes --- class: bottom, left background-image: url(./fig/Poisot2019.png) background-size: contain [Poisot et al. (2019)](https://doi.org/10.1016/j.tree.2019.04.005) --- class: top background-image: url(./fig/birch.jpg) background-size: 60% background-position: 112% 50% .dt[ Image credit: [Pooktree](https://en.wikipedia.org/wiki/Tree#/media/File:Person-tree.jpg) ] ## Two aspects -- 1. .red[**Developing**] a new Domain Specific MetaData Scheme -- 2. .red[**Using**] the new Domain Specific MetaData Scheme ??? We needed a schme to describe our data --- ## Tool(s) which will help -- #### [.blue[ developer & scientists]: .red[tools they know]] ??? What do we know: Spreadsheets, Web Browser & R -- #### develop a new Domain Specific MetaData Scheme [.blue[ developer & scientists]: .red[R & Spreadsheet]] - together with scientists - encourage buy in & "ownership" of scheme ??? - As simple as possible, but as complex as necessary - Spreadsheet to define scheme -- #### enter MetaData [.blue[ user]: .red[browser & Spreadsheet]] - easy, fun & transparent ??? - Spreadsheet to enter data -- #### validate MetaData [.blue[ user]: .red[browser]] - report of errors and inconsistencies ??? - Validation report as html or pdf -- #### export MetaData [.blue[ user]: .red[browser]] - formats suitable for repository submission & indexing ??? - at the moment only .xml --- background-image: url(./fig/Rlogo.png) background-size: 50% background-position: middle --- class: top background-image: url(./fig/Rlogo.png),url(./fig/R-kitten.jpg) background-size: 20%, 100% background-position: 100% 0%, 50% 50% .dt[ Image credit: [youtube](https://i.ytimg.com/vi/Uzd9j2r7Hm8/maxresdefault.jpg) ] --- class: top background-image: url(./fig/Rlogo.png),url(./fig/two-kittens.jpg) background-size: 20%, 100% background-position: 100% 0%, 50% 50% .dt[ Image credit: [catsrusrescue](https://catsrusrescue.org/our-shelters-are-always-open-for-business/) ] --- class: top background-image: url(./fig/Rlogo.png),url(./fig/two-kittens.jpg) background-size: 20%, 80% background-position: 100% 0%, 120% 100% .dt[ Image credit: [catsrusrescue](https://catsrusrescue.org/our-shelters-are-always-open-for-business/) ] ## Two R packages -- 1. **scheme generator - developer - `dmdScheme`** - template scheme, - base functionality for - entering, - validating and - exporting the metadata and - tools to develop new schemes and packages -- 2. **actual scheme - user - e.g. `emeScheme`** - a specific scheme and can contain - additional functionality for - entering, - validating and - exporting the metadata and - additional specific tools ??? Let's look at the actual spreaadsheet --- class: inverse, center, top # How to use <br/> a Domain Specific MetaData Scheme from a user / researcher perspective no R knowledge necessary basic GUI (shiny app) is available also possible from R --- class: inverse, center, top background-image: url(./fig/sceptical-dog-cropped.jpg), url(./fig/sceptical-cat.jpg) background-size: 37%, 37% background-position: 0% 100%, 100% 100% .dt[ Image credit: [imgflip](https://imgflip.com/memetemplate/128802203/Sceptical-Cat) [fsmedia](https://fsmedia.imgix.net/bb/ae/c8/cf/ac4d/4437/817c/82a78283e799/the-popularity-of-cbd-products-isnt-validated-by-science----yet.jpeg) ] # How to use <br/> a Domain Specific MetaData Scheme from a user / researcher perspective no R knowledge necessary basic GUI (shiny app) is available also possible from R --- class: center, top background-image: url(./fig/ShinyApp-0.png) background-size: 43% background-position: 50% 25% --- class: center, top background-image: url(./fig/ShinyApp-1.png) background-size: 43% background-position: 50% 25% --- class: center, top background-image: url(./fig/ShinyApp-2.png) background-size: 43% background-position: 50% 25% --- class: center, top background-image: url(./fig/Spreadsheet_Experiment.png) background-size: 107% background-position: middle center ??? - Experiment based - bibliometric & content metadata --- class: center, top background-image: url(./fig/Spreadsheet_Species.png) background-size: 107% background-position: middle center ??? - content metadata - structural metadata --- class: center, top background-image: url(./fig/Spreadsheet_DataFileMetaData.png) background-size: 107% background-position: middle center ??? - link to data files - structural metadata --- class: center, top background-image: url(./fig/ShinyApp-3.png) background-size: 43% background-position: 50% 25% --- class: center, top background-image: url(./fig/ShinyApp-4.png) background-size: 43% background-position: 50% 25% --- class: center, top background-image: url(./fig/ValidationReport.png) background-size: 105% background-position: middle center --- class: center, top background-image: url(./fig/ShinyApp-5.png) background-size: 43% background-position: 50% 25% --- class: center, top background-image: url(./fig/xml.png), url(./fig/xml.png) background-size: 50%, 50% background-position: 0% 0%, 100% 100% --- class: center, top background-image: url(./fig/ShinyApp-0.png) background-size: 43% background-position: 50% 25%, --- class: center, top background-image: url(./fig/happy-dog-cropped.jpg), url(./fig/happy-cat.jpg) background-size: 53%, 50% background-position: 0% 100%, 100% 100% --- class: center, top background-image: url(./fig/workflow_enterMetaData.png) background-size: contain background-position: top center --- class: inverse, center, top background-image: url(./fig/frightened-dog.jpg), url(./fig/frightened-cat.jpg) background-size: 37%, 37% background-position: 0% 100%, 100% 100% .dt[ Image credit: [the conversation](https://theconversation.com/cat-muzzles-cruel-or-useful-118885) [the conversation](https://images.theconversation.com/files/143301/original/image-20161026-11265-1xnf3ye.jpg?ixlib=rb-1.1.0&q=45&auto=format&w=926&fit=clip) ] --- class: inverse, center, top background-image: url(./fig/frightened-dog.jpg), url(./fig/frightened-cat.jpg) background-size: 37%, 37% background-position: 0% 100%, 100% 100% .dt[ Image credit: [the conversation](https://theconversation.com/cat-muzzles-cruel-or-useful-118885) [the conversation](https://images.theconversation.com/files/143301/original/image-20161026-11265-1xnf3ye.jpg?ixlib=rb-1.1.0&q=45&auto=format&w=926&fit=clip) ] # How to develop <br/> a new Domain Specific MetaData Scheme (scheme) developer perspective (basic) R knowledge necessary --- class: center, top background-image: url(./fig/terrified-cat.jpg) background-size: contain background-position: top center .dt[ Image credit: [ask ideas](https://www.askideas.com/31-very-funny-scared-images-and-photos/)] --- class: center, top background-image: url(./fig/workflow_newScheme.png) background-size: contain background-position: top center --- class: center, middle, inverse # Where to go from Here? --- class: center, top background-image: url(./fig/future-cat.gif) background-size: 100% background-position: 100% 100% .dt[ Image credit: [gifer](https://gifer.com/en/80Y2)] --- class: right, top background-image: url(./fig/future-cat.png) background-size: 50% background-position: 0% 100% # Where to go from Here? -- ### Spread the Word ### Develop `dmdSchemes` -- ### Search Engine for all `dmdScheme` ### Web Based Repository of schemes ### Integrate in Existing Repositories --- class: center, top background-image: url(./fig/Group_2019.jpg) background-size: cover background-position: bßottom center A .red[BIG THANK YOU] to everybody in the our group -- Particularly the Guinea Pigs --- class: center, bottom background-image: url(./fig/qr_talk_github.svg),url(./fig/qr_dmdScheme.svg),url(./fig/big_coffee.jpg),url("https://zenodo.org/badge/DOI/10.5281/zenodo.3516850.svg"),url("https://zenodo.org/badge/DOI/10.5281/zenodo.3229615.svg") background-size: 10%, 10%, 80%, 20%, 20% background-position: 25% 98%, 89% 98%, 50% 0%, 1% 95%, 65% 95% # Talk <span style="padding-left:350px"> `dmdScheme` <span style="padding-left:210px"> .dt[ Image credit: [eater.com](https://www.eater.com/2014/10/30/7131255/starbucks-free-drink-record-83-101-shots-espresso) ] --- class: center, middle background-image: url(./fig/hercules.jpg) [Let's sing together ](https://www.youtube.com/watch?v=yOL-EJZjmp0)