ddod for foia organizations

17
Demand-Driven Open Data More info: Contact http://ddod.us [email protected] | tw: @DPortnoy DDOD for FOIA Organizations (Freedom of Information Act) Originally presented in March 2015 to FOIA directors meeting for all agencies under HHS Concepts explored here do not necessarily represent the views of HHS. 9/2015

Upload: david-portnoy

Post on 14-Apr-2017

338 views

Category:

Technology


0 download

TRANSCRIPT

Page 1: DDOD for FOIA organizations

Demand-Driven Open Data

More info:Contact

http://[email protected] | tw:@DPortnoy

DDOD for FOIA Organizations(Freedom of Information Act)

Originally presented in March 2015 to FOIA directors meeting for all agencies under HHS

Concepts explored here do not necessarily represent the views of HHS. 9/2015

Page 2: DDOD for FOIA organizations

Abstract

Industry and researchers can get valuable data through DDOD (Demand-Driven Open Data) and FOIA (Freedom of Information Act). Both paths have their advantages and limitations. A subset of FOIA requests can also be worked as DDOD "use cases". Such requests involve structured, machine-readable datasets that continue to be generated and are not for restricted data. This paper explains how combining the two methods by working this subset of requests in parallel significantly improves the effectiveness both DDOD and FOIA.

This paper was originally presented at the FOIA directors meeting for all HHS (U.S. Dept of Health and Human Services) agencies in March 2015.

Page 3: DDOD for FOIA organizations

Demand-Driven Open Data (DDOD) is a framework of tools and methods that…

Provide external data users✽ with a systematic, ongoing and transparent way to tell HHS what data they need

...To be managed, measured and executed in terms of use cases, enabling allocation of limited resources based on value

What is DDOD?

✽ Including industry, researchers, nonprofits, media and other government organizations

More effective open data initiativesMore engaged user communityMore economic value & discoveries delivered

Page 4: DDOD for FOIA organizations

Add

What does DDOD deliver to the user community?Implementation of a use case could fall into one of 3 categories

Time to execute

Cos

t / E

ffort Improve

Catalog

Facilitate deployment of ● New datasets● New APIs

For existing datasets● Add needed fields● Improve data quality● Add / improve metadata● Add / improve API

If datasets already exist in legacy systems, make them more available and discoverable

● Publicize availability● Index to HealthData.gov and Data.gov● Instructions for efficient FOIA handling

Current State

Page 5: DDOD for FOIA organizations

Processes for administration of use cases, such as • Encouraging responsiveness, transparency and documentation • Ensuring use cases and resulting datasets are indexed in HealthData.gov

Specialized tools for administering use cases • Workflow engine, communications method, knowledge base • Data processing, storage, hosting, versioning

Proactive outreach to industry and academia for a thriving community

What does DDOD provide to data owners?

1

2

3

Page 6: DDOD for FOIA organizations

“The Freedom of Information Act (FOIA) is a law that gives you the right to access information from the federal government. It is often described as the law that keeps citizens in the know about their government.”

What is FOIA?

Source: FOIA.gov

Page 7: DDOD for FOIA organizations

U.S. Department of Health & Human Services

What does FOIA look like within HHS?

Source: FOIA.gov

Agency Received Processed Pending

ACF 1,864 1,255 763

CMS 26,361 25,027 4,717

OIG 843 815 34

CDC 1,141 1,028 691

FDA 10,224 10,191 2,896

HRSA 355 352 92

IHS 152 160 20

NIH 1,169 1,156 84

SAMHSA 193 238 51

OS 1,531 1,605 314

OASH 437 455 120

ACL 13 11 2

Total 44,283 42,293 9,784

Page 8: DDOD for FOIA organizations

Some FOIA requests look quite similar to DDOD use cases

FOIA requests that have all of the following attributes could be potential DDOD use cases

1. The data is still being generated or refreshed on a regular basis. (Not a single pull of historical information.)

2. The data can be delivered in a machine readable, structured format, such as CSV, JSON, XML or Excel files. (Not freeform text-only, PDF, or scanned images.)

3. The data has widespread usefulness for multiple organizations. (Excluding complaints and investigations, which are often specific to an individual or organization, as is often the case in lawsuits.)

Page 9: DDOD for FOIA organizations

Although only a fraction of the 45,000 FOIA requests received so far by HHS have attributes that make them applicable to DDOD, there are still plenty that could become high value use cases.

Page 10: DDOD for FOIA organizations

But both FOIA and DDOD have their own challenges DDOD’s challenges:

● DDOD is a relatively new program that doesn’t enjoy the awareness, recognition and credibility that FOIA has developed over the years

● Proactive outreach to the user community is a big effort and expense for DDOD

● The toolset for managing use cases is early in its lifecycle and is still being developed

● While the DDOD workflow is documented and actively in use, it’s still being fine-tuned

FOIA’s challenges:

● Growing backlog, due to high demand, complexity of some requests and limited resources

● Process for spotting similar requests and posting the response to an electronic “reading room” is subjective and time consuming

● Difficult to maintain consistency in both format and actual data delivered. Requests may be subject to interpretation. Each request may be fulfilled by a different employee or contractor

● There’s no mechanism for automatically refreshing results for data that continues to be generated

Page 11: DDOD for FOIA organizations

Joining forces on applicable requests is mutually beneficial, because it can help address many of these challenges.

Page 12: DDOD for FOIA organizations

DDOD can benefit FOIA by...1. Reducing number of future requests

More data indexed and made publically available via HealthData.gov (See diagram)

Data that’s automatically being refreshed doesn’t need to be requested multiple times

Rework stemming from inconsistent format and data values between consecutive extracts on refreshed data

Better documentation of data provenance and usage via use case methodology

Data User runs search on HealthData.gov

Data User creates / updates use case

DDOD Admin engages Data Owner on use case

• keywords, subject • data dictionary

HealthData.gov

Data Dictionary

DatasetInventory

EDI*

Use Case

DDOD Admin ensures changes to EDI get propagated to HD.gov

* Enterprise Data Inventory (EDI), which is a catalog of HHS “Strategically Relevant Data Assets”

Data Dictionary

DatasetInventory

DDOD Admin enters use case on HD.gov with links to specifications

Data Owner adds entry to EDI, including metadata

DDOD Admin curates entry & ensures SLAs

DDOD Admin creates repository for use case

Process of adding a new DDOD use case

Page 13: DDOD for FOIA organizations

DDOD can benefit FOIA by… (cont)

2. Fulfilling the electronic “reading room” requirement for requests made multiple times by publishing the data using HealthData.gov infrastructure. Results end up benefiting more users.

3. Instituting best practices for open data projects in terms of machine readability, standards, documentation and access

4. Helping address the backlog, specifically for the subset of requests applicable to DDOD. These are also the requests that are beneficial to a wider audience.

Much of the current backlog is unrelated to machine readable datasets that serve the broader user community. FOIA is straining under the weight of one-time requests that benefit only the few. For example, a single lawsuit-related request to CDC required processing of 90,000 documents.

Page 14: DDOD for FOIA organizations

FOIA can benefit DDOD by...

1. Since FOIA is legislated and well-known, it can bring in more use cases into DDOD.

By identifying specific requests that are applicable to DDOD

By promoting DDOD on FOIA websites and materials

2. DDOD can leverage work being done anyway to fulfill FOIA requests

Data owner already needs to be involved and resources -- whether employees or contractors -- need to be allocated to do the extract

Once a data extract is performed, it might be trivial to configure periodic runs of the same code and perform automated pushes to HealthData.gov

3. Association with FOIA lends additional awareness, recognition, credibility to DDOD

Page 15: DDOD for FOIA organizations

Requester receives

documents

Requester can sue

Data owner produces

documents

Submit for fulfillment to data owner

Requester can appeal

We start with a simplified overview of the FOIA workflow to determine how DDOD requests could be initiated and executed in parallel

Agree to fulfill

Requests that are partially or incorrectly fulfilled can also go through the same appeals process

Agency decides whether and what

to release

Agency obtains clarification (if needed)

Request routed to ultimate

agency

Respond within 20 business days

(Can extend for unusual circumstances: Research, volume, consultation)

Route within 10 business days

Decline request Agree to

fulfill

Decline request

Requester submits request

to FOIA

Page 16: DDOD for FOIA organizations

In DDOD workflow, use case execution relies on three types of participants: Data User, DDOD Admin, and Data Owner. The Data User initiates the request

All implementation decisions ultimately are made by the Data Owner

The DDOD Admin only facilitates the process when needed

Page 17: DDOD for FOIA organizations

Although it’s a legislation-driven process, FOIA organizations may voluntarily notify DDOD on incoming requests. For such requests, the DDOD workflow works in parallel to the FOIA workflow.

Dat

aU

ser

Dat

aO

wne

rD

DO

D

Adm

in

Document ImplementEngage Discuss

Document Engage Clarify

Document Implement Discuss Clarify

FOIA

Org Workflow for FOIA - DDOD partnership

Typical FOIA request