summary topic (v) software & tools€¦ · claude poirier, matthias templ summary topic (v)...

12
Summary Topic (v) Software & Tools C. Poirier 1 M. Templ 2,3,4 1 2 STATISTIK AUSTRIA Die Infor mationsmanager 3 4 UNECE Worksession on Editing, Oslo, 25/09/12 Claude Poirier, Matthias Templ Summary Topic (v) Oslo, 25/09/12 1 / 10

Upload: others

Post on 25-Sep-2020

2 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Summary Topic (v) Software & Tools€¦ · Claude Poirier, Matthias Templ Summary Topic (v) Oslo, 25/09/12 7 / 10. deducorrect and editrules Matthias editrules and deducorrect editrules

Summary Topic (v)Software & Tools

C. Poirier1 M. Templ2,3,4

1

2S T A T I S T I K A U S T R I A

D i e I n f o r m a t i o n s m a n a g e r

3

4

OGdata-analysis

UNECE Worksession on Editing, Oslo, 25/09/12Claude Poirier, Matthias Templ Summary Topic (v) Oslo, 25/09/12 1 / 10

Page 2: Summary Topic (v) Software & Tools€¦ · Claude Poirier, Matthias Templ Summary Topic (v) Oslo, 25/09/12 7 / 10. deducorrect and editrules Matthias editrules and deducorrect editrules

R Matthias

R

We saw that the popularity of R increases also in OfficialStatistics

New Developments come almost exquisitly within R

Exchange of scripts and code is easy

R as mediator [Todorov and Templ, 2012, Templ and Todorov, 2012]

Note: http://cran.r-project.org/web/views/OfficialStatistics.html

Claude Poirier, Matthias Templ Summary Topic (v) Oslo, 25/09/12 2 / 10

Page 3: Summary Topic (v) Software & Tools€¦ · Claude Poirier, Matthias Templ Summary Topic (v) Oslo, 25/09/12 7 / 10. deducorrect and editrules Matthias editrules and deducorrect editrules

R Matthias

R

We saw that the popularity of R increases also in OfficialStatistics

New Developments come almost exquisitly within R

Exchange of scripts and code is easy

R as mediator [Todorov and Templ, 2012, Templ and Todorov, 2012]

Note: http://cran.r-project.org/web/views/OfficialStatistics.html

Claude Poirier, Matthias Templ Summary Topic (v) Oslo, 25/09/12 2 / 10

Page 4: Summary Topic (v) Software & Tools€¦ · Claude Poirier, Matthias Templ Summary Topic (v) Oslo, 25/09/12 7 / 10. deducorrect and editrules Matthias editrules and deducorrect editrules

R Claude

General Questions

Prototype stage

- Do you keep realistics objectives? Avoid open scope?

Production stage

- Do you keep up with maintenance/research activities?

Foundation

- or SAS: Do you have good reasons to use one, not the other?

Claude Poirier, Matthias Templ Summary Topic (v) Oslo, 25/09/12 3 / 10

Page 5: Summary Topic (v) Software & Tools€¦ · Claude Poirier, Matthias Templ Summary Topic (v) Oslo, 25/09/12 7 / 10. deducorrect and editrules Matthias editrules and deducorrect editrules

TEA Matthias

TEA for editing and imputation

Use of SQLite, R and C. Really a need to use SQLite when having apowerful server or desctop computer?

Use of multiple imputation methods - do you call the original code ofIVEWARE within R? Better to use irmi? [Templ et al., 2011]

How about the generation of implicite edits from explicite edits. Isthat considered? Which heuristic is used?

Is TEA already used in the production process?

Will the package be available on CRAN in future?

Claude Poirier, Matthias Templ Summary Topic (v) Oslo, 25/09/12 4 / 10

Page 6: Summary Topic (v) Software & Tools€¦ · Claude Poirier, Matthias Templ Summary Topic (v) Oslo, 25/09/12 7 / 10. deducorrect and editrules Matthias editrules and deducorrect editrules

Treemaps and Tab(le)plots Matthias

Treemaps and Tableplots for visual editing

Visual tools are very efficient to analyse your data - also for editing.

treemap only for dissemination of graphics? Interactivity possible, i.e.to get some meta-data when clicking on a region?

tableplot gives a great overview of the data matrix.

possible improvement: if used in editing it indicates which bin is an“outlier”, but there seems to be no way to interactively show thecorresponding observations in the bin (relation of graphic toindividuals is lost). Better to use GAUGIN for that purpose?

why to calculate the bins only with arithmetic means?

Claude Poirier, Matthias Templ Summary Topic (v) Oslo, 25/09/12 5 / 10

Page 7: Summary Topic (v) Software & Tools€¦ · Claude Poirier, Matthias Templ Summary Topic (v) Oslo, 25/09/12 7 / 10. deducorrect and editrules Matthias editrules and deducorrect editrules

R X12 Matthias

R - X12

for batch processing of X12-ARIMA and letting the output and parameterchoice be better processed and accessible.

Many advantages in the production stage over pure X12-ARIMA

Is there an ongoing cooperation with the CENSUS Bureau?

Is the package already applied in production?

What are the differences (advantages/disadvantage) to Demetra?

Claude Poirier, Matthias Templ Summary Topic (v) Oslo, 25/09/12 6 / 10

Page 8: Summary Topic (v) Software & Tools€¦ · Claude Poirier, Matthias Templ Summary Topic (v) Oslo, 25/09/12 7 / 10. deducorrect and editrules Matthias editrules and deducorrect editrules

Screening UNIDO Matthias

Screening in UNIDO

model-based outlier detection of short time series

pilot study if robust methods are useful

many problems occurs in the data used: missing values and shorttime series. Is it really useful to automatically check the series usingcomplex models?

Claude Poirier, Matthias Templ Summary Topic (v) Oslo, 25/09/12 7 / 10

Page 9: Summary Topic (v) Software & Tools€¦ · Claude Poirier, Matthias Templ Summary Topic (v) Oslo, 25/09/12 7 / 10. deducorrect and editrules Matthias editrules and deducorrect editrules

deducorrect and editrules Matthias

editrules and deducorrect

editrules and deducorrect looks promising.

to apply it in production, many user may need a graphical userinterface. Is the development of a point and click interface in plan?

are implicite edits generated from the explicit ones?

which heuristic is used for the generation of implicite edits? (theproblem is NP-hard)

Claude Poirier, Matthias Templ Summary Topic (v) Oslo, 25/09/12 8 / 10

Page 10: Summary Topic (v) Software & Tools€¦ · Claude Poirier, Matthias Templ Summary Topic (v) Oslo, 25/09/12 7 / 10. deducorrect and editrules Matthias editrules and deducorrect editrules

SAS Macros, AMELIA Claude, Matthias

SAS Macros, AMELIA

SAS Macros:

Are the SAS macros available for other offices?

AMELIA2 application:

Single imputation using expected values versus single imputationusing stochastic noise.

Also of interest to compare the results from mice [van Buuren andGroothuis-Oudshoorn, 2011], irmi [Templ et al., 2011] and mi[Yu-Sung et al.].

Claude Poirier, Matthias Templ Summary Topic (v) Oslo, 25/09/12 9 / 10

Page 11: Summary Topic (v) Software & Tools€¦ · Claude Poirier, Matthias Templ Summary Topic (v) Oslo, 25/09/12 7 / 10. deducorrect and editrules Matthias editrules and deducorrect editrules

SAS Macros, AMELIA Claude, Matthias

SAS Macros, AMELIA

SAS Macros:

Are the SAS macros available for other offices?

AMELIA2 application:

Single imputation using expected values versus single imputationusing stochastic noise.

Also of interest to compare the results from mice [van Buuren andGroothuis-Oudshoorn, 2011], irmi [Templ et al., 2011] and mi[Yu-Sung et al.].

Claude Poirier, Matthias Templ Summary Topic (v) Oslo, 25/09/12 9 / 10

Page 12: Summary Topic (v) Software & Tools€¦ · Claude Poirier, Matthias Templ Summary Topic (v) Oslo, 25/09/12 7 / 10. deducorrect and editrules Matthias editrules and deducorrect editrules

References

References

M. Templ and V. Todorov. Official statistics and survey methodologymeets R. The Survey Statistician, 66:26–34, 2012.

M. Templ, A. Kowarik, and P. Filzmoser. Iterative stepwise regressionimputation using standard and robust methods. Comput Stat DataAnal, 55(10):2793–2806, 2011.

V. Todorov and M. Templ. R in the statistical office: Part 2. Workingpaper, United Nations Industrial Development, 2012.

S. van Buuren and K. Groothuis-Oudshoorn. mice: Multivariateimputation by chained equations in R. Journal of Statistical Software,45(3):1–67, 2011. URL http://www.jstatsoft.org/v45/i03/.

S. Yu-Sung, A. Gelman, J. Hill, and M. Yajima. Multiple imputation withdiagnostics (mi) in R: Opening windows into the black box. Journal ofStatistical Software, 45.

Claude Poirier, Matthias Templ Summary Topic (v) Oslo, 25/09/12 10 / 10