Page 1
!What Academia Can Learn from Open Source
Creative Commons Attribution 3.0 Unported License
Arfon Smith [email protected] @arfon
"
Page 11
A story from my life (10 years ago)
Page 13
tl;dr - technical, but brimming with inefficiencies
Page 14
http://www.flickr.com/photos/blachswan
Page 15
http://www.flickr.com/photos/esoastronomy/
Page 16
http://www.flickr.com/photos/esoastronomy/http://www.flickr.com/photos/jamiegilbert
Page 17
http://amandabauer.blogspot.com/
Page 20
Diffraction grating
Telescope
Detector
Page 26
130 130 1 2048 189 189 258 258 480 562 378 378 493 521 390 397 851 851 247 274 319 319 304 580 493 511 610 636 188 188 228 228
> cat bad_pix_mask.txt
Page 28
Wasteful
2 days work
Page 29
Wasteful
2 days work 3 observing runs/week
Page 30
Wasteful
2 days work 3 observing runs/week 52 weeks in year
Page 31
Wasteful
2 days work 3 observing runs/week 52 weeks in year 15 year detector lifetime
Page 32
Wasteful
2 days work 3 observing runs/week 52 weeks in year 15 year detector lifetime
2*3*52*15 = 4680 days (13 years)
Page 33
Wasteful… but the norm
2 days work 3 observing runs/week 52 weeks in year 15 year detector lifetime
2*3*52*15 = 4680 days (13 years)
Page 34
A second story from my life (2 months ago)
Page 41
Software composed of many components
Page 42
Your software is the thing that is different
Page 43
Open Source: Ubiquitous culture of reuse
Page 44
Why isn’t academia like this?
Page 47
http://dx.doi.org/10.1051/0004-6361
Page 48
Careers are based on paper counts
Page 49
Careers are based on paper citations
Page 50
Three major problems
Page 51
1. ’Novel’ results preferred
Page 52
2. Reduced collaboration
Page 53
3. The format sucks
Page 55
Explain what you did
Page 56
So that others can repeat
Page 58
It’s the way that we explain that matters most
Page 60
State of the art technology
Page 61
State of the art technology… for the late 17th century*
* Michael Nielsen
Page 63
Data, methods, prose
Page 64
http://www.nature.com/news/2011/111005/full/478026a.html
Page 69
Complex stuff
Numbers, data Science!
Page 70
Re
pro
duc
ibili
ty
Data intensive
Page 71
Verification may take years (if at all)
Page 73
What do open source collaborations do well?
Page 74
Open source collaborations
Open Source vs Open Collaborations
Page 75
Open source collaborations
Open Source: the right to modify, not the right to contribute.
Page 76
Open source collaborations
Open Collaborations: a highly collaborative development process and are receptive to contributions of code, documentation, discussion, etc from anyone who shows competent interest.
Page 77
Open source collaborations
Open Collaborations: a highly collaborative development process and are receptive to contributions of code, documentation, discussion, etc from anyone who shows competent interest.
THIS
Page 78
Ubiquitous culture of reuse
Page 79
Expose their collaborative process
Page 80
How do 4000 people work together?
Page 89
discuss improve
Code first, permission later
Page 90
Every time this happens the community learns
Page 95
Merged pull requests
Page 98
“open source is…reproducible by necessity”
Fernando Perez
http://blog.fperez.org/2013/11/an-ambitious-experiment-in-data-science.html
Page 99
Better at collaborating because they have to be
Page 100
(doesn’t hav
e to mean th
is)Open Public?=
Page 101
‘Open Source’ way of working
Page 102
Open (within your team, department or institution)
Page 103
Electronic & Available
Page 104
Asynchronous, exposed process
Page 106
Low friction collaboration
Page 107
Academia can learn from open source
Page 108
Academia must learn from open source
Page 110
What’s happening in academia today?
Page 111
Collaboration around code
Page 117
Collaborative authoring
Page 120
Collaborative teaching
Page 124
Where might more significant change happen?
Page 125
Where do communities form?
Page 126
Around a shared challenge?
Page 127
Around shared data?
Page 129
10 ?nLevel 1 (continual)
Level 2 (periodic)
Page 130
Supernovae Weak lensing
Active Galactic NucleiSolar System
Galaxies
Transients/variable stars
Large-scale structure
Stars, Milky WayStrong lensing
Informatics and Statistics
Dark Energy (DESC)
Page 132
Software composed of many components
Page 133
Your software should be the thing that is different
Page 134
science too!Your software should be the thing that is different
Page 135
Scientific data is becoming more open
Page 136
http://www.nature.com/news/2011/111005/full/478026a.html
Page 137
How do we make this behaviour the norm?
Page 139
“Academic environments of today do not reward tool builders”
Ed Lazowska, OSTP event
http://lazowska.cs.washington.edu/MS/MS.OSTP.pdf
Page 148
“publishing a paper about code is basically just advertising”
David Donoho
http://www.stanford.edu/~vcs/Video.html
Page 150
How to derive meaningful metrics from open contributions?
Page 160
Barriers are cultural, not technical
Page 161
Why should we care?
Page 162
Because we paid for it?
Page 163
Because open=good?
Page 164
Because care about the creation of knowledge?
Page 165
Open source has solved much of what academia needs
Page 166
Our challenge is to adapt and evolve the academy in this new collaborative age
Page 167
Thanks
[email protected] @arfon
"