basic reading, writing and informatics skills for biomedical research

47
17 August 2012 Ganesha Associates 1 Basic reading, writing and informatics skills for biomedical research Segment 6. Experimental design

Upload: kuper

Post on 24-Feb-2016

19 views

Category:

Documents


0 download

DESCRIPTION

Basic reading, writing and informatics skills for biomedical research. Segment 6. Experimental design. Introduction . What is Experimental Design ? Statistics? “The organization of an experiment to allow effective testing of the research hypothesis .” - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Basic reading, writing and informatics skills for biomedical research

Ganesha Associates 117 August 2012

Basic reading, writing and informatics skills for biomedical

researchSegment 6. Experimental design

Page 2: Basic reading, writing and informatics skills for biomedical research

Ganesha Associates 2

Introduction

• What is Experimental Design ?– Statistics?

• “The organization of an experiment to allow effective testing of the research hypothesis.”

• “The design of any information-gathering exercises where variation is present, whether under the full control of the experimenter or not.”

17 August 2012

Page 3: Basic reading, writing and informatics skills for biomedical research

Ganesha Associates 3

Some myths about ED• Myth 1

– Its better to spend time collecting data than sitting around thinking about collecting data, just get into the lab/field and start making measurements!

• Reality– No! A well-designed experiment will save

you a lot of time by eliminating unnecessary repetition and improving the precision of your measurements

– Hint: analyse data as you generate it – quality control, test assumptions

17 August 2012

Page 4: Basic reading, writing and informatics skills for biomedical research

Ganesha Associates 4

Some myths about ED• Myth 2

– “It doesn’t matter how you collect your data, there will always be a statistical ‘fix’ that will allow you to analyse them”

• Reality– No! Common problems are non-

independence of data, lack of prior knowledge of the likely variance of parameters being measured, and absence of appropriate control or reference groups.

17 August 2012

Page 5: Basic reading, writing and informatics skills for biomedical research

Ganesha Associates 5

Some myths about ED• Myth 3

– “If you collect lots of data something interesting will come out and you will be able to detect even very subtle effects”

• Reality– No! Generally collecting lots of data without a

well-formulated hypothesis wastes your time and someone else’s money.

– Remember that if you analyse a data set many different ways, you are bound to discover effects that have a p-value of less than 0.05.

17 August 2012

Page 6: Basic reading, writing and informatics skills for biomedical research

Ganesha Associates 617 August 2012

Start with the project proposal – quick check list

• Why is the problem under study of importance– Economic, medical significance ?– What are the underlying key issues of basic scientific significance – Establish strong links to the consensus view ?

• How is the problem to be addressed experimentally ?– Has an appropriate model system been chosen ?– What information needs to be collected ?– Which methods have been chosen for this purpose and why ?

• Limitations– Have the most-likely reasons for failure been identified ?– What is the ‘Fail early’ strategy ?

• Literature review– Is it up-to-date ?– Are all key points of logical development in the text backed by an

appropriate reference ?

Page 7: Basic reading, writing and informatics skills for biomedical research

Ganesha Associates 7

The anatomy of an experiment

17 August 2012

Page 8: Basic reading, writing and informatics skills for biomedical research

Ganesha Associates 8

The anatomy of an omics experiment

• A sample of cells (≈ obs units), from an organism (≈ exp units), with a history, on which the measurements are to be compared in one or more possible states, e.g. genotype, disease, chemical treatment (≈ treatments)

• to which a lot of delicate wet lab preliminaries - extraction, amplification, labelling are applied

• leading to the analyte being submitted to a complex piece of equipment, on which lots (10s - 1,000,000s) of measurements are made.

17 August 2012

Page 9: Basic reading, writing and informatics skills for biomedical research

Ganesha Associates 9

The anatomy of an omics experiment

• The measurements can be fluorescence intensities or counts, often highly pre-processed, with transformation, normalization and summarization usually being needed before analysis.

• There is rarely a single objective, and frequently no clearly stated goal; often it’s a screen ≈ fishing expedition, e.g. a good outcome can be finding ≥1, many, if not all genes or proteins or metabolites satisfying some condition.

17 August 2012

Page 10: Basic reading, writing and informatics skills for biomedical research

Ganesha Associates 10

Key factors to think about• Baseline assumptions• Sources of variance• Need for technical, biological replication• Sample size, statistical power and significance• Choice of controls• Non-independence of data• Confounding variables• Randomization, stratification• Bias, blinding• Multiple testing, data mining• Inferring causality

17 August 2012

Page 11: Basic reading, writing and informatics skills for biomedical research

Ganesha Associates 1117 August 2012

Collecting data – keep a notebook

Page 12: Basic reading, writing and informatics skills for biomedical research

Ganesha Associates 1217 August 2012

Collecting data – make a spreadsheet

Page 13: Basic reading, writing and informatics skills for biomedical research

Ganesha Associates 1317 August 2012

Collecting data – check key assumptions

Page 14: Basic reading, writing and informatics skills for biomedical research

Ganesha Associates 1417 August 2012

Page 15: Basic reading, writing and informatics skills for biomedical research

Ganesha Associates 1517 August 2012

Publication doesn’t guarantee that the design is correct!

Page 16: Basic reading, writing and informatics skills for biomedical research

Ganesha Associates 1617 August 2012

Page 17: Basic reading, writing and informatics skills for biomedical research

Ganesha Associates 1717 August 2012

Page 18: Basic reading, writing and informatics skills for biomedical research

18

“Why Most Published Research Findings Are False”John Ioannidis, PLoS Medicine, 2005

• Small studies are less likely the research findings are to be true.

• Small effect sizes are less likely to be true.• The greater the number and the lesser the selection of

tested relationships in a scientific field, the less likely the research findings are to be true.

• The greater the flexibility in designs, definitions, outcomes, and analytical modes in a scientific field, the less likely the research findings are to be true.

• The hotter a scientific field (with more scientific teams involved), the less likely the research findings are to be true.

17 August 2012 Ganesha Associates

Page 19: Basic reading, writing and informatics skills for biomedical research

Ganesha Associates 1917 August 2012

How citation distortions create unfounded authoritySteven Greenberg, BMJ, 2009

Many published biomedical belief systems are built on sound data, with authors repeating claims after trusting the published expert opinion of their colleagues. However, there are incentives for generating and joining information cascades regardless of their soundness. Joining an information cascade aids publication as articles have to say something and negative results are biased against. Generating and joining an information cascade may improve the likelihood of obtaining research funding because hypothesis driven research is an essential requirement at many research funding agencies and successful funding generally requires a “strong hypothesis . . .

Page 20: Basic reading, writing and informatics skills for biomedical research

Ganesha Associates 2017 August 2012

In life sciences there are many unknowns

“As we know,There are known knowns.There are things we know we know.We also knowThere are known unknowns.That is to sayWe know there are some thingsWe do not know.But there are also unknown unknowns,The ones we don't knowWe don't know.”

Donald Rumsfeldt, US Secretary of Defense (sic)Feb. 12, 2002, Department of Defense news briefingfrom "The Poetry of D.H. Rumsfeldt" http://slate.msn.com/id/2081042/

Page 21: Basic reading, writing and informatics skills for biomedical research

Ganesha Associates 2117 August 2012

Presenting your ideas• Create a slide show that is an outline, not a

script• Use the slide show...

– to select important information and visuals– to organize content – to create a hierarchy

• Many of the subsequent slides were adapted from work done by the Cain Project in Engineering & Professional Communication

• www.owlnet.rice.edu/~cainproj

Page 22: Basic reading, writing and informatics skills for biomedical research

Ganesha Associates 2217 August 2012

Selecting Content• Consider your audience – not everyone

will have your knowledge of the problem!• State problem/question clearly, early and

repeat (in the title, in the introduction)• Explain the significance, context• Include background:

organism/system/model• State the point of departure for work

precisely

Page 23: Basic reading, writing and informatics skills for biomedical research

Ganesha Associates 2317 August 2012

Displaying Text• Remember that your audience...

– skims each slide– looks for critical points, not details – needs help reading/ seeing text – So keep to an outline only

• Help your audience by…– Projecting a clear font– Using bullets– Using content-specific headings– Using short phrases– Using grammatical parallelism

Page 24: Basic reading, writing and informatics skills for biomedical research

Ganesha Associates 2417 August 2012

Project a clear font

• Serif: easy to read in printed documents– Times New Roman, Palatino, Garamond

• Sans serif: easy to see projected across the room– Arial, Helvetica, Geneva

Page 25: Basic reading, writing and informatics skills for biomedical research

Ganesha Associates 2517 August 2012

Use bullets – but not too many

• Bullets help your audience– to skim the slide– to see relationships between information– organize information in a logical way

• For example, this is Main Point 1, which leads to...– Sub-point 1

• Further subordinated point 1• Further subordinated point 2

– Sub-point 2

Page 26: Basic reading, writing and informatics skills for biomedical research

Ganesha Associates 2617 August 2012

Use content-specific headings

• “Results” suggests the content area for a slide

• “Substance X up-regulates gene Y” (with data shown below) shows the audience what is observed

Page 27: Basic reading, writing and informatics skills for biomedical research

Ganesha Associates 2717 August 2012

Use short phrases• Be clear, concise, accurate

• Write complete sentences only in certain cases: Hypothesis / problem statement Quote ???

Difficult to read

DNA polymerase catalyzes elongation of DNA chains in

the 5’ to 3’ direction

Better

DNA polymerase extends 5’ to 3’

Page 28: Basic reading, writing and informatics skills for biomedical research

Ganesha Associates 2817 August 2012

Use grammatical parallelism• Use same grammatical form in lists

• Not Parallel:– Lyse cells in buffer– 5 minute centrifuging of lysate– Supernatant is removed

• Parallel:– Lysed cells in buffer– Centrifuged lysate for 5 minutes– Removed supernatant

Page 29: Basic reading, writing and informatics skills for biomedical research

Ganesha Associates 2917 August 2012

Use grammatical parallelismHow would you revise this list?

Telomeres• Contain non-coding DNA• Telomerases can extend telomeres• Cells enter senescence/apoptosis when telomeres

are too short

Page 30: Basic reading, writing and informatics skills for biomedical research

Ganesha Associates 3017 August 2012

Use grammatical parallelismOne possible revision…

Telomeres• Contain non-coding DNA• Are extended by telomerase• Cause senescence/apoptosis when shortened too

much

Page 31: Basic reading, writing and informatics skills for biomedical research

Ganesha Associates 3117 August 2012

Displaying visuals• Select visuals that enhance understanding

– Figures from your work: evidence for argument

– Figures from other sources (web; review articles): • Model a process or concept• Help explain background, context

• Design easy-to-read visuals– Are the visuals easy to read by all members of

your audience?• Draw attention to aspects of visuals

Page 32: Basic reading, writing and informatics skills for biomedical research

Ganesha Associates 3217 August 2012

Simplify and draw attention

http://www.indstate.edu/thcme/mwking/tca-cycle.html

Page 33: Basic reading, writing and informatics skills for biomedical research

Ganesha Associates 3317 August 2012

Cite others’ visuals

http://www.bioc.rice.edu/~shamoo/shamoolab.html

Harvey et al. (2005) Cell 122:407-20

Page 34: Basic reading, writing and informatics skills for biomedical research

Ganesha Associates 3417 August 2012

SamplesFeatures to consider:• Text

– Fonts, use of phrases, parallelism• Visuals

– Readability, drawing attention• Slide design• Organization/ hierarchy

– Titles, bullets, arrangement of information, font size

Page 35: Basic reading, writing and informatics skills for biomedical research

Ganesha Associates 3517 August 2012

Page 36: Basic reading, writing and informatics skills for biomedical research

Ganesha Associates 3617 August 2012

Page 37: Basic reading, writing and informatics skills for biomedical research

Ganesha Associates 3717 August 2012

Page 38: Basic reading, writing and informatics skills for biomedical research

Ganesha Associates 3817 August 2012

Page 39: Basic reading, writing and informatics skills for biomedical research

Ganesha Associates 3917 August 2012

Page 40: Basic reading, writing and informatics skills for biomedical research

Ganesha Associates 4017 August 2012

The Calcium IonCalcium is a crucial cell-signaling molecule

–Calcium is toxic at high intracellular concentrations because of the phosphate-based system energy system

–Intracellular concentrations of calcium are kept very low, which allows an influx of calcium to be a signal to alter transcription

Page 41: Basic reading, writing and informatics skills for biomedical research

Ganesha Associates 4117 August 2012

Microarrays

Phillips G. (2004) Iowa State University College of Veterinary Medicine.

Page 42: Basic reading, writing and informatics skills for biomedical research

Ganesha Associates 4217 August 2012

Presenting

• Delivery• Handling questions

Page 43: Basic reading, writing and informatics skills for biomedical research

Ganesha Associates 4317 August 2012

Delivery• Physical Environment• Stance

– Body language– Handling notes

• Gestures• Eye contact• Voice quality

– Volume– Inflection– Pace

Page 44: Basic reading, writing and informatics skills for biomedical research

Ganesha Associates 4417 August 2012

Handling Questions

• LISTEN• Repeat or rephrase• Watch body language• Don’t pretend to know

Page 45: Basic reading, writing and informatics skills for biomedical research

Ganesha Associates 4517 August 2012

Practical activity 6a - Developing and presenting your project

• Total duration - ca. 2 hours.• Identify the five most important research articles that frame your

hypothesis, i.e. the fundamental facts and assumptions upon which your idea is based.

• Describe the basis for your hypothesis in a paragraph of no more than seven sentences.

• Read the article by Peter Norvig on experimental design. (For Firefox users the alternative URL is here.)

• What alternative experimental approaches are available to answer your question ?

• How do you intend to verify your hypothesis?• Identify and justify the journal you want to publish the results of your

research in. • Give a 5-slide presentation to justify your choices at the next

session.

Page 46: Basic reading, writing and informatics skills for biomedical research

Ganesha Associates 4617 August 2012

Practical activity 6b - Thinking about probability and statistics

• Total duration - ca. 3 hours.• First read the series of articles published recently by Wai-Ching Leung in the British Medical Journal.

Although intended for a medical audience, these article provide the basis for a useful primer for all most fields of biomedical research. The articles are:

• Why and when do we need medical statistics • Measuring chances • Summarising information • Testing hypotheses • Now answer the following questions:• I have a plant extract which I believe has an effect on blood pressure. I measure its effects by injecting the

substance into rats and measuring their blood pressure before and after the injection. The statistical test I use tells me that the probability of collecting this sample of results is less than 0.05. What does this mean ?

• 1% of women aged forty who participate in routine mamography screening have breast cancer. 80% of the women with breast cancer get a positive result. 9.6% of women without breast cancer will also get a positive result. So, if a woman from this group gets a positive result, what is the probablity that she has breast cancer ?

• In the UK, car registration plates can typically consist of a string of 6 or 7 alphanumeric characters (A, B, C, etc, 1, 2, 3 etc). So the probability of a specific sequence of characters (e.g. DB1979) is less than 1 in 2 billion. I send a small group of people out into a car park and ask them to look for a registration plate that has personal significance for them. What is the likelihood of this happening ?

• A friend of mine has consistently predicted the results of 5 of the football matches leading to today's final. He is offering to sell me his prediction for the final match so that I can place a bet and make some money. What are the odds that he will predict the outcome of the last match correctly ?

• A murder is committed. Traces of your fingerprints are found on the murder weapon. What is the probability that you are guilty ?

Page 47: Basic reading, writing and informatics skills for biomedical research

Ganesha Associates 4717 August 2012

Practical activity 6c - Presenting data

• Total duration - ca. 1 hour.• Read Mary Purugganan's presentation

about data visualisation. Identify some examples of illustrations used in recent primary research papers which illustrate some of the points she makes.