ucla gseis

Download Ucla GSEIS

Post on 19-Jan-2015




1 download

Embed Size (px)




  • 1. the knowledge web13 november 2008 ucla graduate school of education and information sciences

2. knowledge gaps process failurestransaction costs lost opportunities 3. knowledge gaps process failurestransaction costs lost opportunities is the answer more information? 4. many information products advance incrementally 5. the discovery process: 6. the discovery process: 7. thanks to the products, we already know a lot... 8. we need information innovations and process innovationsto match product innovations. 9. 1. the digital commons represents amethodology that lowers the cost and increases the volume of transactions at the knowledge layer of the net 10. does the ability to ask more questions, faster, lead us to moreknowledge or just more data? 11. whats different about communications and computers? 12. 1. we know stuff. 13. 1. we know stuff. 2. open networks. 14. content codephysical 15. content codephysical 16. knowledge contentcode physical 17. knowledge rights 18. knowledge rights 19. the commons 20. digital commons interoperability low transaction costslaw and technology 21. user interface to copyright 22. 140,000,000+ digital objects onlineunder our licenseslicenses ported to 50+ countriesintegrated with Google,Yahoo, Firefox, Microsoft Ofce... 23. 2. the digital commons is a stablemethodology to manage data, materials, and content for science. 24. project development do no harmfunding pro bono running code community think marketearly focus on life sciencesexploring climate change, geospatial, elsewhere 25. what would move via the science network? 26. Open Access Content 27. making knowledge legally and technically available for re-use and composition into new knowledge. 28. we use digital tools to replicate paper technology 29. creative expression 30. ideas or facts 31. e=mc2 32. the container, not the facts. 33. the container, not the facts.but locks the container. 34. IGFBP-5 plays a role in the regulation of cellular senescence via a p53-dependent pathway and in aging-associated vascular diseases 35. IGFBP-5 plays a role in the regulation of cellular senescence via a p53-dependent pathway and in aging-associated vascular diseases 36. indexing: disallowed. http://orpheus-1.ucsd.edu/acq/license/cdlelsevier2004.pdf 37. c >1000 journals under CCimage from the public library of science licensed to the public under CC-BY 3.0 38. PubMedCentral ~ 1,000,000 articles permissions granted: 50,000 (6% of PMC legal for transformative use) (.003 of all PubMed records) 39. what do these ideas mean in a world of integrated data? 40. creativework? 41. So, out of all of thisdiscussion my question is whetherChemSpider is Content or Data. - Antony Williams 42. The motivation behind this memorandum isinteroperability of scientic data. 43. +is it legal? 44. ++ + + is it legal? + + ++ 45. 1 Converge on the public domain by waiving all rights based on intellectual property 2 Converge on the public domain by waiving other statutory or intellectual property rights. 3 Converge on the public domain by imposing no contractual controls. 4 Provide for interoperation with databases not available under the Protocol through open metadata 46. a protocol, not a license. 47. conicts with the protection instinct 48. conicts with the protection instinct the protection instinct is frequently an instinct to protect freedom 49. 3. we have to build infrastructure for data into the web of documents that we have. 50. solves the legal problem 51. but not the container problem. 52. web 2.0, science 3.0, what aboutmaking Google work better? 53. over 200years at one paper/day 54. what you want isa list of genes.not a list of documents. 55. building a web for data: the semantic web 56. making computers understand links between documentslinks to Web pageWeb page 57. making computers understand relationships between concepts causesdrinking coffeefeel awake 58. causes drink coffeefeel awake 59. http://ontology.foo.org/causescauses drinking coffee feel awakehttp://ontology.foo.org/drinking coffeehttp://ontology.foo.org/feel awake h 60. we need a Domain Name System for concepts: http://sciencecommons.orgcoffeehttp://ontology.foo.org/coffee 61. use the web tointegrate information from different placesand different names coffeecafecoffee http://ontology.foo.org/coffee kopi 62. bed person located atget out of bedlast subeventdoes not want wantsget out of bedaftercauses drink coffee feel awakerst subeventsubevent causes feel jitteryopen eyesafter after make coffee pour coffee pick up cupdrinkis ais for located in coffeecafe property ofoften near often near wet cup sugar 63. (too much work for coffee) 64. (distributed, networked approaches start tolook pretty good) 65. Open Source Data Integration 66. formatting digital knowledge intomodular building blocks for composition into new knowledge. 67. e pluribus unum. 68. we can transform complex queries into links prefix go: prefix rdfs: Mesh: Pyramidal Neuronsprefix owl: prefix mesh: prefix sc: prefix ro: select ?genename ?processname where{ graph Pubmed: Journal Articles{ ?paper ?p mesh:D017966 . ?article sc:identified_by_pmid ?paper. ?gene sc:describes_gene_or_gene_product_mentioned_by ?article.}graph Entrez Gene: Genes{ ?protein rdfs:subClassOf ?res.?res owl:onProperty ro:has_function.?res owl:someValuesFrom ?res2.?res2 owl:onProperty ro:realized_as. ?res2 owl:someValuesFrom ?process. graph {{?process go:GO_0007166} union{?process rdfs:subClassOf go:GO_0007166 }}?protein rdfs:subClassOf ?parent. ?parent owl:equivalentClass ?res3. GO: Signal Transduction?res3 owl:hasValue ?gene.} graph { ?gene rdfs:label ?genename }graph { ?process rdfs:label ?processname}} 69. we can transform complex queries into links http://hcls1.csail.mit.edu:8890/sparql/?query=prex%20go%3A%20%3Chttp%3A%2F%2Fpurl.org%2Fobo%2Fowl%2FGO%23%3E%0Aprex%20rdfs%3A %20%3Chttp%3A%2F%2Fwww.w3.org%2F2000%2F01%2Frdf-schema%23%3E%0Aprex%20owl%3A%20%3Chttp%3A%2F%2Fwww.w3.org%2F2002% 2F07%2Fowl%23%3E%0Aprex%20mesh%3A%20%3Chttp%3A%2F%2Fpurl.org%2Fcommons%2Frecord%2Fmesh%2F%3E%0Aprex%20sc%3A%20% 3Chttp%3A%2F%2Fpurl.org%2Fscience%2Fowl%2Fsciencecommons%2F%3E%0Aprex%20ro%3A%20%3Chttp%3A%2F%2Fwww.obofoundry.org%2Fro %2Fro.owl%23%3E%0A%0Aselect%20%3Fgenename%20%3Fprocessname%0Awhere%0A%7B%20%20graph%20%3Chttp%3A%2F%2Fpurl.org% 2Fcommons%2Fhcls%2Fpubmesh%3E%0A%20%20%20%20%20%7B%20%3Fpaper%20%3Fp%20mesh%3AD017966%20.%0A%20%20%20%20%20%20% 20%3Farticle%20sc%3Aidentied_by_pmid%20%3Fpaper.%0A%20%20%20%20%20%20%20%3Fgene%20sc% 3Adescribes_gene_or_gene_product_mentioned_by%20%3Farticle.%0A%20%20%20%20%20%7D%0A%20%20%20graph%20%3Chttp%3A%2F% 2Fpurl.org%2Fcommons%2Fhcls%2Fgoa%3E%0A%20%20%20%20%20%7B%20%3Fprotein%20rdfs%3AsubClassOf%20%3Fres.%0A%20%20%20%20% 20%20%20%3Fres%20owl%3AonProperty%20ro%3Ahas_function.%0A%20%20%20%20%20%20%20%3Fres%20owl%3AsomeValuesFrom%20%3Fres2.% 0A%20%20%20%20%20%20%20%3Fres2%20owl%3AonProperty%20ro%3Arealized_as.%0A%20%20%20%20%20%20%20%3Fres2%20owl% 3AsomeValuesFrom%20%3Fprocess.%0A%20%20%20graph%20%3Chttp%3A%2F%2Fpurl.org%2Fcommons%2Fhcls%2F20070416%2Fclassrelations%3E %0A%20%20%20%20%20%7B%7B%3Fprocess%20%3Chttp%3A%2F%2Fpurl.org%2Fobo%2Fowl%2Fobo%23part_of%3E%20go%3AGO_0007166%7D% 0A%20%20%20%20%20%20%20union%0A%20%20%20%20%20%20%7B%3Fprocess%20rdfs%3AsubClassOf%20go%3AGO_0007166%20%7D%7D%0A %20%20%20%20%20%20%20%3Fprotein%20rdfs%3AsubClassOf%20%3Fparent.%0A%20%20%20%20%20%20%20%3Fparent%20owl% 3AequivalentClass%20%3Fres3.%0A%20%20%20%20%20%20%20%3Fres3%20owl%3AhasValue%20%3Fgene.%0A%20%20%20%20%20%20%7D%0A% 20%20%20graph%20%3Chttp%3A%2F%2Fpurl.org%2Fcommons%2Fhcls%2Fgene%3E%0A%20%20%20%20%20%7B%20%3Fgene%20rdfs%3Alabel%20% 3Fgenename%20%7D%0A%20%20%20graph%20%3Chttp%3A%2F%2Fpurl.org%2Fcommons%2Fhcls%2F20070416%3E%0A%20%20%20%20%20%7B% 20%3Fprocess%20rdfs%3Alabel%20%3Fprocessname%7D%0A%7D&format=&maxrows=50 70. we can transform complex queries into links 71. we can help scholars remix queriesprex go: prex rdfs: prex owl: prex mesh: prex sc: prex ro: select ?genename ?processnamewhere{ graph mesh:D009369 { ?paper ?p ?article sc:identied_by_pmid ?paper.. Mesh: Cancer ?gene sc:describes_gene_or_gene_product_mentioned_by ?article. }graph { ?protein rdfs:subClassOf ?res. ?res owl:onProperty ro:has_function. ?res owl:someValuesFrom ?res2. ?res2 owl:onProperty ro:realized_as. ?res2 owl:someValuesFrom ?process.graph {{?process go:GO_0006610} uniongo:GO_0006610 }}{?process rdfs:subClassOf ?protein rdfs:subClassOf ?parent.GO: Ribosomal Protein ?parent owl:equivalentClass ?res3. ?res3 owl:hasValue ?gene.}graph { ?gene rdfs:label ?genename }graph { ?process rdfs:label ?processname}} 72. we can build a corpus of queries as links 73. we can re-use cultural tools for scholarship 74. we can make science user-driven. 75. 4. institutions have to provide a stable foundation for the knowledge web. 76. process revolutions: the networkHuntingtons Parkinsons ALSMultipleSclerosisAutism 77. institutional revolutions: the network HuntingtonsParkinsonsALS Multiple Sclerosis Autism 78. the library to me: location, structure, discovery, preservation 79. the infrastructure for this is very, very shaky. 80. prex dc: prex skos: prex rdfs: prex owl: prex sc: prex foaf: 81. what are the odds that the organizations making thenamespaces will be here in 50 years? 100 years? 82. Huntingtons 83. Huntingtons Parkinsons ALSMultipleSclerosisAutism 84. Huntingtons Parkinsonslibrary ALSMultipleSclerosisAutism 85. In any case, it is clear that a library containing all possiblebooks, arranged at random, is equivalent (as a source ofinformation) to a library containing zero books.http://en.wikipedia.org/wiki/The_Library_of_Babel 86. exponential content growth 87.


View more >