downloading from the seamless data distribution …€¦ · web viewflowerdew, r., and m. green....

39
The Yale Map Collection GIS Workshop #3 Finding GIS Data & Preparing It for Use Census Data Download and Import into ArcGIS Goals of this Tutorial: To ensure that you can download both cartographic and demographic data from the U.S. Bureau of the Census in a form that can be imported into ArcGIS. To ensure that you know how to join attribute data to cartographic data. 1) First, if you are using Internet Explorer, you must add the census sites to your trusted sites list to allow

Upload: others

Post on 11-Jun-2020

1 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Downloading from the Seamless Data Distribution …€¦ · Web viewFlowerdew, R., and M. Green. "Developments in Areal Interpolation Methods and GIS." The Annals of Regional Science

The Yale Map Collection

GIS Workshop #3

Finding GIS Data & Preparing It for Use

Census Data Download and Import into ArcGIS

Goals of this Tutorial: To ensure that you can download both cartographic and demographic data from

the U.S. Bureau of the Census in a form that can be imported into ArcGIS. To ensure that you know how to join attribute data to cartographic data.

1) First, if you are using Internet Explorer, you must add the census sites to your trusted sites list to allow downloads and pop-ups for this exercise. In Internet Explorer, go to Tools>Internet Options and click on the Security Tab. Highlight the green Trusted Sites icon, and click the Sites… button. Uncheck the “Require server verification…” radio button. Add census.gov and factfinder.census.gov to the Trusted Web Sites list. Click OK twice to exit the Security setting dialog.

Page 2: Downloading from the Seamless Data Distribution …€¦ · Web viewFlowerdew, R., and M. Green. "Developments in Areal Interpolation Methods and GIS." The Annals of Regional Science

Downloading and Pre-processing Census Tract Data from the Internet

2) Create a directory called Census_Data_Download in your working directory (it is always a good idea to place your working directory on the C: Drive, and without spaces or special characters in the folder name).

3) Go to www.census.gov.4) Click on “Geography”. 5) Click on “Census 2000 Geographic Products”. 6) Click on “ Census 2000 Map Series ” under the Census 2000 Maps heading. 7) Click on “BOUNDARY FILES.” 8) Click on the “Download Boundary Files” icon on the left hand side. 9) Click on “2000” next to the words “Census Tracts.” (Note that you are

provided with MANY possible Census Boundary File choices on this page. However, the census does not provide access to the Block level boundary file from the website, although these can be obtained from the ESRI Data & Maps CD Set, or from the ESRI Download Website.)

10) Scroll down to “Census 2000: Census Tracts in ArcView Shapefile (.shp) format.”

11) Click on “Connecticut - tr09_d00_shp.zip (220,667 bytes).” When prompted, save this file to your \Data\Shapefile folder.

12) Next navigate to the \Data\Shapefile folder, right-click on the file you just downloaded and “Extract All.”

13) Make sure you have the three files with the extensions .dbf, .shx, .shp.

Page 3: Downloading from the Seamless Data Distribution …€¦ · Web viewFlowerdew, R., and M. Green. "Developments in Areal Interpolation Methods and GIS." The Annals of Regional Science

Subsetting the Census Boundary File to an Area of Interest (AOI)

The next step is to extract New Haven county census tracts from Connecticut state data using the FIPS (Federal Information Processing Standard) code attribute.

1) Open ArcMap with a new empty map and add the Connecticut census tracts to your map.

2) Save your map in your Census_Data_Download directory as “Census_Data_Download.mxd”

3) Go to the FIPS PUB 6-4 Lookup website to find out which FIPS code is associated with New Haven County (in this case, its 009).

4) In the Main Menu of ArcMap, Go to Selection>Select by Attributes.5) Enter (or click on the appropriate Field name, operator, and value)

"COUNTY" = '009' in the query dialog box. Click Apply and then Close.6) The census tracts for New Haven County will be highlighted.7) Right click on tr09_d00 layer in the Table of Contents on the left and Select

Data>Export Data. 8) The Export: Selected Features option will be the default, since there is an active

selection in the Map Document. Browse to your working directory and Save the Export Shapefile as New_Haven_Tracts.shp. Click OK.

9) Click Yes when prompted to add the new shapefile to the map as a layer.10) Right-Click on the tr09_d00 layer and choose Remove to remove it from the

current project.11) Click on the Full Extent button to zoom into New Haven County.12) Right-Click on New_Haven_Tracts layer and Open the Attribute Table.13) Notice that you have 184 features. Notice that you have almost no attribute data,

other than numerical identifiers.14) Notice that you do have a column called Tract. This is going to be our Key Field

that we use to join attribute data to our geography.15) Save your map.16) If your data looks like the picture below… Congratulations!, you got the

cartographic data that you need.

Page 4: Downloading from the Seamless Data Distribution …€¦ · Web viewFlowerdew, R., and M. Green. "Developments in Areal Interpolation Methods and GIS." The Annals of Regional Science

Downloading and Pre-Processing Attribute Data

In this section we are going to download attribute data that pertains to the population that lives in the census tracts for which you downloaded boundaries in the previous section. The attribute data that we want is the racial split of the population. After we download the data we are going to edit it Excel in order to import it into the GIS, and we will do it in such a way that we can assign each census tract its appropriate population/race values.

1) Go to http://www.census.gov/2) Click on “American FactFinder” on the left side column.3) Click on “get data” under Decennial Census.4) Make sure the radio button for “Census 2000 Summary File 1 (SF 1) 100-

Percent Data” in the middle of the page.5) In the “Select from the following” column in the right side of the page, select

“Detailed Tables”.

Page 5: Downloading from the Seamless Data Distribution …€¦ · Web viewFlowerdew, R., and M. Green. "Developments in Areal Interpolation Methods and GIS." The Annals of Regional Science

6) In the new page under “Select a geographic type” choose “Census Tract” from the Dropdown menu. Select “Connecticut” from the state dropdown menu and then choose “New Haven County” from the county dropdown menu.

7) Under “Select one or more geographic areas…” menu select “All Census Tracts” and then click the “Add” button. Highlight the first “Census Tract 0” item and click Remove to remove it from the list. Click the “Next” button.

8) In the “Select one or more tables…” field choose table “P3. Race” and click on “Add”. Then click “Show Result”

You now have a table of attributes where each column is a census tract and each row is a different racial attribute. The page shows only the first 10 census tracts, but you can scroll to the “next” page if you want to see more. We won’t be using all of the attributes (rows), but we will need to download the entire table and then cut out what we don’t need.

However, we are still missing one important element. We need a common identifier for each census tract that we can use to join this table with our table in the GIS.

9) Under “Options” on the top of the page choose “Show Geographic Identifiers”.

Now you see two tables of information (if you scroll down). The first table shows all of the geographic identifiers for each tract, and the second table is the table of racial attributes that we saw before. We are going to download both of these tables so that we can edit them.

10) Under the “Print/Download” at the top of the page, choose “Download” to start downloading the table.

11) When the download table opens, scroll down to the section with the “Database compatible (data rows only)” heading:

a. Choose the Comma delimited (.txt).b. Uncheck the “Include descriptive data element names” box.c. Before you download the data set you have created, you should click on

Technical Documentation (PDF) link, and save the file to your working folder. This PDF file continas information about the SF1 Data, including how the data is collected and a Data Dictionary.

d. Click “OK” to download the data file.12) When the Save File dialog opens, Save the “output.zip” file to your \Data\

Tables folder.13) Browse to the folder you saved the output.zip file in and extract it to that folder.

This should result in four files:1. dt_dec_2000_sf1_u_data1.txt2. dt_dec_2000_sf1_u_geo.txt3. dt_readme.txt4. readme_dec_2000_sf1.txt

Page 6: Downloading from the Seamless Data Distribution …€¦ · Web viewFlowerdew, R., and M. Green. "Developments in Areal Interpolation Methods and GIS." The Annals of Regional Science

14) Start Microsoft Excel and Browse to the folder that you extracted your attribute data to. Change the “Files of Type” drop-down to “All Types” and Open the dt_dec_2000_sf1_u_data1.txt file.

15) In the Text Import Wizard, change the ‘Original Data Type” to Delimited, then click Next. Check the “Comma” radio button and click Finish

16) Click on the A column identifier at the top of the table. Scroll to the right and Shift-Click on X to select all the columns from A to X. Right-Click on the X and select Delete.

17) Repeat the above step to remove columns B through AO.

18) Repeat the above step for columns E through W.

19) In the Main Menu, select Save As. Browse to the folder where you extracted the data (if it is not already there), and change the “Save as Type” drop-down to CSV (Comma Delimited). Click Save, and Yes when warned about changing the Data Type.

20) Close Excel and Click NO when prompted to save changes.

Opening the Attribute Table in ArcMap

1) Return to ArcMap2) Click the Add Data button and browse to the folder that you extracted the Census

Attribute Data to.3) Add the dt_dec_2000_sf1_u_data1.csv file to ArcMap.4) Notice that your Table of Contents View changes to the Source Tab. This is

because you have added data that has no explicit geographic display.

Preparing the Key Field in the Tract Boundary file for Joining to the Attribute File

Like many things the government does the census data is not perfect and must be altered before it can be joined. The problem is that the TRACT records in the two separate tables are not formatted in the same way. We will need to alter these records to provide ourselves with identically formatted and recorded records

1) Right-Click on the dt_dec_2000_sf1_u_data1.csv table and OPEN it.2) Right-Click on the New_Haven_Tracts layer and Open the Attribute Table.3) Scroll to the right in both tables and find the TRACT field in each one.

Page 7: Downloading from the Seamless Data Distribution …€¦ · Web viewFlowerdew, R., and M. Green. "Developments in Areal Interpolation Methods and GIS." The Annals of Regional Science

4) Note that in the dt_dec_2000_sf1_u_data1.txt table the TRACT entries are recorded with 6 significant places, while in the New_Haven_Tracts table the TRACT entries are recorded either with 4 or 6 significant places.

The explanation for the way the TRACT number is recorded in the dt_dec_2000_sf1_u_data1.txt table lies in the way that the Census Bureau creates new Census Tracts in the face of increasing population. When a Census Tract becomes larger than the Census finds is appropriate, that Tract is typically ‘split’ into two Tracts. When this happens, the Tract Number is appended with a suffix of 01, 02, 03, etc… (Depending on the number of times a Tract has been split).

What we need to do is create a Field in one of the tables that matches exactly a Field in the other table. These two identical fields can then be used to join the two datasets. In this case, there is a field in the Tracts Boundary file called NAME, in which the Census Tracts are recorded with the suffixes appearing after a decimal

Page 8: Downloading from the Seamless Data Distribution …€¦ · Web viewFlowerdew, R., and M. Green. "Developments in Areal Interpolation Methods and GIS." The Annals of Regional Science

point. This means that we can simply multiply this field by 100 to arrive a value that corresponds to the Tract values in the dt_dec_2000_sf1_u_data1.txt table.

5) In the New_Haven_Tracts Attribute Table, click on the Options button and select Add Field.

6) Name the new field KEY_TRACT, and give it the Type: LONG INTEGER.

7) Right-Click on the Field Header for the new KEY_TRACT field and Select Calculate Values.

8) In the Field Calculator, enter the following argument:

100 * (CDbl ( [NAME] ))

In this case, the field NAME, in the New_Haven_Tracts table, is formatted as a STRING so that we must convert the STRING value in the field to a Double Value (Using the CDbl() function) before we can multiply it by 100.

Click OK to calculate the new values for the KEY_TRACT field.

9) You can now close both Attribute Tables, if you have not already.

Joining the Attribute Data to the Boundary File

1) Right-Click on the New_Haven_Tracts layer in the Table of Contents and Select Joins and Relates>Join…

2) Assign the values to the Join Data dialog box as shown below:

a. Join attributes from a table

Page 9: Downloading from the Seamless Data Distribution …€¦ · Web viewFlowerdew, R., and M. Green. "Developments in Areal Interpolation Methods and GIS." The Annals of Regional Science

b. …join will be based on: KEY_TRACTc. …the table to join…: dt_dec_2000_sf1_u_data1.csvd. …field in the table to base the join on: TRACT

3) Click OK to apply the Join.4) Open the Attribute Table for the New_Haven_Tracts layer and note the

attribute data has now been joined to the boundary file.

If we would like to make the attribute data a permanent part of the boundary file, we can do so by exporting the Joined layer to a new shapefile.

5) Right-Click on the New_Haven_Tracts layer and go to Data>Export Data…6) Make sure Export:All features is selected. Browse to your working folder and

name the Export shapefile New_Haven_Tracts_Attribs. Click OK and Select Yes when prompted to add the layer to your map layout.

7) Open the Attribute Table of the new layer and note that the New_Haven_Tracts prefix has been removed from the field names in the Attribute Table. This is because the data is no longer “joined,” but is now part of the shapefile we have created.

Page 10: Downloading from the Seamless Data Distribution …€¦ · Web viewFlowerdew, R., and M. Green. "Developments in Areal Interpolation Methods and GIS." The Annals of Regional Science

Finally

The procedure outlined here can be used to download any of the census boundary files and associate those boundaries with census attribute data, except in the case of census blocks. The census block boundary files can be downloaded from the ESRI Census Data Download website. ESRI provides the boundary and attribute data separately, just as the census does; however, the ESRI Census Data contains geographic identifiers that have already been altered to make them comparable for joining attributes to boundaries.

Additional Suggested Reading:

Brewer, C., and T. Suchan. Mapping Census 2000: The Geography of U. S. Diversity. ESRI Press, 2001.

Bureau, U. S. C. Summary File 1 Technical Documentation: 2000 (2003) Bureau, U. S. C. Census of Population and Housing, Summary File 3: Technical

Documentation (2000) Bureau, U. S. C. "Census 2000 Basics." Washington DC: US Government

Printing Office (2002)

Overlay of Demographic Datasets with Unlike Boundaries

IntroductionThe U.S. Census is one of the most often used datasets on earth. It provides a

convenient (mostly) source of hundreds of demographic variables at multiple spatial resolutions, updates at regular intervals. However, researchers often want to use census demography with other types of features that do not overlay with the census boundaries exactly. This tutorial presents a procedure for applying census demography to features of interest using “Areal Weighting.”

Tutorial Data

The data for this tutorial can be downloaded from: http://www.library.yale.edu/MapColl/Downloads/Workshops/Finding_Data_Workshop03_Data.zip

Page 11: Downloading from the Seamless Data Distribution …€¦ · Web viewFlowerdew, R., and M. Green. "Developments in Areal Interpolation Methods and GIS." The Annals of Regional Science

In this Tutorial, we will use two datasets as base data:1. CTMajorbasins.shp – This shapefile contains the watershed boundaries of

Connecticut. Note that these boundaries extend beyond the Connecticut state boundaries, so that they include the entire watershed, independent of political boundaries. Additional information may be found here.

2. CTblkgrp.shp – This shapefile contains a subset of the U.S. Census Block Groups, which provides boundaries and demographic information for Census block groups within United States. This subset was created by selecting and exporting all Census Block Groups that intersect with the Majorbasins.shp shapefile.

Getting Ready1. Browse to the folder you unzipped the tutorial materials to and look for a folder

called \ Finding_Data_Workshop03_Data. Open this folder, and then open the Census_Overlay_Tutorial.mxd.

2. You should get something like this:

Page 12: Downloading from the Seamless Data Distribution …€¦ · Web viewFlowerdew, R., and M. Green. "Developments in Areal Interpolation Methods and GIS." The Annals of Regional Science

Areal Weighted Interpolation

The ProblemIn this exercise, we are interested in finding out the total population within each

of our major watersheds. Our problem is that the boundaries of our watersheds to not correspond with our census boundaries. Here, we will assign a proportion of the census population count for each census block group based upon the area of each block group that falls within each of the watershed features.

First, we would like to take a closer look at what the problem is.

1. From the Main Menu of ArcMap, select View>Bookmarks>AOI #1

Note that the watershed boundaries (in blue) do not correspond spatially with the census block group boundaries. In this area, we note that the block group is split between two watersheds, and an area outside our area of interest. We want to assign population values from this block group (and all others), based upon the area that falls within each of our features of interest.

Page 13: Downloading from the Seamless Data Distribution …€¦ · Web viewFlowerdew, R., and M. Green. "Developments in Areal Interpolation Methods and GIS." The Annals of Regional Science

Calculate the Area for Census Block GroupsThe first thing we need to do is establish the area of each of the census block

groups accurately. 1. Open your ArcToolbox pane, if it is not already open. Click on the Search Tab

at the bottom of the pane and Search for “Areas.”2. Double-Click on the Calculate Areas tool.3. Select CTblkgrp as your Input Feature Class. For the Output Feature Class,

Browse to the folder you are working from, and name the Output “CTblkgrp_Area.” Click OK.

4. The Calculate Areas tool will place the resulting shapefile in the Table of Contents. Right-Click on the CTblkgrp_Area layer and Open the Attribute Table.

5. Scroll to the right and note that the Calculate Areas tool has added a new field called “F_AREA,” which contains the area of each census block group feature, measured in square meters.

Preserving the Original Area ValuesBecause we will use the Calculate Areas tool again in this procedure, which will

overwrite the “F_AREA” field (if it exists), we need to create a new field to place these values into.

Page 14: Downloading from the Seamless Data Distribution …€¦ · Web viewFlowerdew, R., and M. Green. "Developments in Areal Interpolation Methods and GIS." The Annals of Regional Science

1. At the bottom of the Attribute Table window, Click the Options buttons and select Add Field.

2. Name the new field “ORIG_AREA” and Assign the Type as “Double.” Click OK.

3. Right-Click on the “ORIG_AREA” field header and Select “Calculate Values.”4. In the Field Calculator, under the “Fields:” list box, Scroll down and double-

click on “F_AREA” to add this value to the argument pane. Click OK to apply this calculation. Note that the “F_AREA” values have been transferred to the new field.

5. Close the Attribute Table.

Merging the Two Boundary FilesNow we would like to merge the two boundary files. This is done in order to

allow us to measure the area of the portions of each census block group that partially falls within each watershed boundary. This will also assign the appropriate watershed name to

Page 15: Downloading from the Seamless Data Distribution …€¦ · Web viewFlowerdew, R., and M. Green. "Developments in Areal Interpolation Methods and GIS." The Annals of Regional Science

each of the census block group portions, allowing us to later create a summary population statistic for each of the watersheds.

1. Go to the ArcToolbox Search Tab again. Search on the term “Union” and open the Union tool.

2. Use the CTblkgrp_Area and the CTMajorbasins as the Input Features. Browse to your working folder and name your Output Feature Class “CTBasin_BlkGrp_Merged.shp.” Leave all other fields as their default values. Click OK to apply the Union tool.

3. What you are left with should look something like this:

Page 16: Downloading from the Seamless Data Distribution …€¦ · Web viewFlowerdew, R., and M. Green. "Developments in Areal Interpolation Methods and GIS." The Annals of Regional Science

Note that what results is a layer that combines the boundaries of the two datasets. Likewise, if you examine the Attribute Table you will notice that each feature contains the attributes of both datasets, where they intersect. The problem is that the individual Block Groups that have been split by a watershed boundary have been given the Total demographic values for the original block group, from which they have been created. What we want to do is calculate a new demographic variable based upon the proportional area of the new feature to its parent feature.

Calculating the Areas for the New Features

1. Return to ArcToolbox and open the Calculate Areas tool again.2. Select CTBasin_BlkGrp_Merged as your Input Feature Class. For the Output

Feature Class, Browse to the folder you are working from, and name the Output “CTBasin_BlkGrp_Merged_Area.” Click OK.

Page 17: Downloading from the Seamless Data Distribution …€¦ · Web viewFlowerdew, R., and M. Green. "Developments in Areal Interpolation Methods and GIS." The Annals of Regional Science

3. Open the Attribute Table for the resulting Layer, CTBasin_BlkGrp_Merged_Area. Scroll to the right and notice that the F_AREA field has been overwritten with the are value for the new features. In some cases, the area has changed, and in some cases it is the same. This is because some, but not all, of the census block groups fall completely within a single watershed, and were therefore not split.

Calculating the Proportion

1. With the Attribute Table still open, Click on the Options button and Select Add Field.

2. Name the new field “AREA_PROP” and Assign the Type as “Float.” Click OK.

3. Now, scroll to the right again and Right-Click on the ORIG_AREA field header and Select “Sort Ascending.” Note that there are a number of records with 0 values for the ORIG_AREA. These are “Slivers” on the coast that do not fall within the census boundaries, and therefore do not have an ORIG_AREA value. These 0 values will cause “division by 0” errors in the field calculator, and do not contain demographic data, anyway. We will “select them out” of our proportion calculation.

4. Click on the Options button at the bottom of the Attribute Table and open the Select by Attributes dialog.

5. In the Query window, enter the following: "ORIG_AREA" = 0

Page 18: Downloading from the Seamless Data Distribution …€¦ · Web viewFlowerdew, R., and M. Green. "Developments in Areal Interpolation Methods and GIS." The Annals of Regional Science

6. Click Apply, and then Close the Select by Attributes dialog.7. Go to the Options Button and select Switch Selection, to invert the selection you

just made. You will be warned about using the Switch Selection operation on a large number of records. You can disregard this and Click Yes.

8. Click on the Show: Selected button. This will “hide” the 0 value records from you AND the field calculator.

9. Scroll to the right again and Right-Click the AREA_PROP field header and Select Calculate Values.

10. In the Field Calculator, enter the following: [F_AREA] / [ORIG_AREA]

Page 19: Downloading from the Seamless Data Distribution …€¦ · Web viewFlowerdew, R., and M. Green. "Developments in Areal Interpolation Methods and GIS." The Annals of Regional Science

11. Click OK to apply the calculation.12. Click on the Show: All button at the bottom of the Attribute Table to show all

the records. 13. In the ArcMap Main Menu, go to Selection>Clear Selected Features to clear

the previous selection. 14. Scroll up and down through the attribute table and notice that many of the records

in the new AREA_PROP field have a value of 1, while the rest have a value of less than 1, indicating that these features are a product of the union of the block group and watershed layers.

Calculating the Areal Weighted Demographic Variable

Now that we have calculated the area of each of the new features in proportion to the original block groups, we can assign a demographic variable to those new features, based upon this proportion.

1. With the Attribute Table still open, Add a New Field and Name it POP2004_WT, give it the Type: Float.

2. Right-Click on the Field Header of the new POP2004_WT and Select Calculate Values.

3. Enter the following argument in the Field Calculator: [POP2004] * [AREA_PROP]

Page 20: Downloading from the Seamless Data Distribution …€¦ · Web viewFlowerdew, R., and M. Green. "Developments in Areal Interpolation Methods and GIS." The Annals of Regional Science

4. Click OK to apply the calculation. You should now have a proportional value for the 2004 population in each of your new features.

Aggregating the Demographic Variable Using the Dissolve Tool

We now have a 2004 Population count for each of the new features, calculated based upon the proportion of the new feature area to its parent feature. What we WANT is the 2004 population count for each of the watershed features we started with. To do this, we need to aggregate the new POP2004_WT variable based upon which watershed each records corresponding feature falls within. To do this, we will use the Dissolve Tool.

1. If you have not already, close the CTBasin_BlkGrp_Merged_Area Attribute Table.

2. Click on the ArcToolbox Search Tab and search on the term “Dissolve.” Open the Dissolve Tool.

3. Select the CTBasin_BlkGrp_Merged_Area Layer as your Input Features. Browse to your working folder and save the result as CT_Basin_POP2004. Select MAJOR as the Dissolve Field (this is the field that indicates the name of the watershed). In the Statistics Field dropdown, select POP2004_WT, and assign its Statistic Type = SUM. Click OK.

Page 21: Downloading from the Seamless Data Distribution …€¦ · Web viewFlowerdew, R., and M. Green. "Developments in Areal Interpolation Methods and GIS." The Annals of Regional Science

4. Right-Click on the CT_Basin_POP2004 and Open the Attribute Table. Notice that there are now only four Fields in the Attribute Table, including the MAJOR field, which indicates each of the watersheds in Connecticut, and the SUM_POP200 (the Field Name size limit has caused ArcMap to crop the Field Name to 10 characters, after appending SUM to the name), which provides the population count for each watershed, aggregated from the portions of the Census block groups that fall within each watershed.

Page 22: Downloading from the Seamless Data Distribution …€¦ · Web viewFlowerdew, R., and M. Green. "Developments in Areal Interpolation Methods and GIS." The Annals of Regional Science

Cleaning up

Finally, we might want to end up with a shapefile that only contains the original watershed features of our CT_Basins shapefile. To end up with this result, we simply need to ‘trim’ our CT_Basin_POP2004 to remove the features that lie outside the original watershed layer. This is easily done, since the parts we want to trim are the ones that do not have an entry for the field MAJOR.

1. With the Attribute Table for CT_Basin_POP2004 open, simple click on the small gray square at the far left of the record/row that has no entry for the field MAJOR.

2. Click on the Options button and select Switch Selection. Close the Attribute Table and note that the watershed features are selected in your Map Document.

3. Right-Click on the CT_Basin_POP2004 layer and Select Data>Export Data.4. ‘Selected Features’ will be the default Export, since you have an active

selection. Browse to your working folder and name your export shapefile “CT_Basins_POP2004_Final.” Click OK, and add the export as a layer in your map.

5. Go to Selection>Clear Selected Features.

Finally

This tutorial has provided a step-by-step guide to the interpolation of demographic variables for geographic boundaries that do not correspond to U.S. Census boundaries that these variables are provided in. It is important to note that the method described here is the simplest of many for this type of interpolation. There are many caveats to be considered when utilizing this and other methods of demographic overlay for research purposes. Perhaps the most important point to be made is that demography

Page 23: Downloading from the Seamless Data Distribution …€¦ · Web viewFlowerdew, R., and M. Green. "Developments in Areal Interpolation Methods and GIS." The Annals of Regional Science

is not necessarily uniformly distributed within the geographic entities that census data collection is based upon. This means that apportioning demographic variables based solely upon areal proportion does not provide perfect results. There are other, more complex methods of interpolating demographic variables to non-census boundaries that provide more accuracy by accounting for the density of infrastructure (streets & intersection) as a way to weight the differential distribution of population within census boundaries.

Additional Suggested Reading:

Sadahiro, Y. "Accuracy of Areal Interpolation: A Comparison of Alternative Methods." Journal of Geographical Systems 1.4 (1999): 323-46.

Flowerdew, R., and M. Green. "Developments in Areal Interpolation Methods and GIS." The Annals of Regional Science 26.1 (1992): 67-78. Yale Fulltext

[PDF] When Census Geography Doesn 't Work

Page 24: Downloading from the Seamless Data Distribution …€¦ · Web viewFlowerdew, R., and M. Green. "Developments in Areal Interpolation Methods and GIS." The Annals of Regional Science

Other Data Downloads and Preparation Skills

Downloading from the Seamless Data Distribution Website

The Seamless Data Distribution Website is maintained by the USGS and is the primary clearinghouse for raster data produced by the USGS, including ortho-imagery, Digital Elevation data, mosaics of Landsat imagery and more. Once familiar with the interface for the Seamless site, you should be able to successfully navigate and use many other ArcIMS-based data sites, as they are based upon the same architecture. In this exercise you will download elevation data for the New Haven, CT area.

1. Go to the USGS Seamless Data Distribution Website at http://seamless.usgs.gov/website/seamless/ and click on the “View and Download United States Data,” link at the right side of the page. Wait for the Map to load.

2. By default, the Zoom Tool will be active. Use it to Zoom into New Haven, CT. This may take several Zooms.

3. On the right side of the Seamless Application, look for the “Download” tab and activate it. Scroll down the list (noting the available downloads), find the Elevation Group and expand it (if not already) by clicking on the triangle next to the word Elevation.

Page 25: Downloading from the Seamless Data Distribution …€¦ · Web viewFlowerdew, R., and M. Green. "Developments in Areal Interpolation Methods and GIS." The Annals of Regional Science

4. Make sure the 1” NED item is checked, and that no other items are enabled. NED is the National Elevation Dataset, which is a raster dataset that describes the elevation at any given point, at several different resolutions (in this case ~30 meters). This type of data is commonly referred to as a Digital Elevation Model, or DEM. Now click on the 1” NED layer name to open a description of the layer in a new window.

5. Enable the Define Download Area Tool , under the Downloads Menu and Drag a box across the city of New Haven to define area of interest (AOI). Note that the box will remain green unless the area you have defined becomes larger that the Seamless Site allows (you can request up to 1.6GB, in 100MB files at once).

Once you release the mouse button, a new page will be opened in a new browser window (you may have to enable pop-ups for the site). This new window will provide some general info about the data you have requested (projection, bounding coordinates, cell size, etc…) as well as Download Link Buttons to begin downloading the data to your hard drive.

6. Click on the Download Button to post the data request to the server. Yet another window will be opened, which indicates the Current Status of your request on the server.

7. When the Save As… dialog box opens, browse to the \Data\Raster folder to save the file. When the download is complete, browse to find the ZIP file you just downloaded, right-click on it and “Extract All.”

8. The data will be extracted to a new subfolder called something like “ned_70561049.” In the rest of this tutorial, this layer will be referred to as ned_#######.

Applying an Appropriate Projection to You Data

Data does not always come in a ‘ready to use’ format. One common necessity is to apply a “Projection” to the data, in order to transform the spatial reference from angular Lat/Long coordinates to planar/linear units, such as feet or meters. Here, you will “project” your Digital Elevation Model to a projection that is appropriate for applying calculations that assume the data is recorded in linear units.

9. Open ArcMap. Use the Add Data button to browse to the folder containing your elevation data. There should be two files there, one a polygon shapefile named “METADATA.shp,” (which contains the footprint of your elevation data, and its metadata), as well a raster layer with a name similar to the folder that

Page 26: Downloading from the Seamless Data Distribution …€¦ · Web viewFlowerdew, R., and M. Green. "Developments in Areal Interpolation Methods and GIS." The Annals of Regional Science

contains it. Select the ned_######## layer and add it to ArcMap.

10. Right-click on the ned_######## layer and Open its Properties Dialog box. Select the Source Tab and scroll down to the Spatial Reference information.

11. Note that there is no Linear Unit assigned to this data. Scroll back up to the top and look at the Cellsize (X,Y) item. This data has a Cellsize of about 30 meters, but here it is notes as 0.0002777777777999463. This is because the data has not been projected and is currently spatially referenced in latitude & longitude coordinates, which are angular units of measurement. Click OK to close the Properties Dialog.

12. Open the ArcToolbox and search for “Hillshade.” Open the Hillshade tool and use your ned_######## Layer as the Input Raster. Name the Output raster “hillshade01” and place it in the \Data\Raster folder. Change the Z factor option to 3 (this exaggerates the elevation for a better visual quality). Click OK to apply the tool.

Page 27: Downloading from the Seamless Data Distribution …€¦ · Web viewFlowerdew, R., and M. Green. "Developments in Areal Interpolation Methods and GIS." The Annals of Regional Science

13. When the tool is finished running, you should see a new layer in your Map View window. However, the effect is has produced is not very attractive. The Hillshade layer we have produced is very dark, and the topography it has created seems far more “extreme” than we might have expected.

These poor results are related to what we observed earlier in the Spatial Reference and Cellsize of our Digital Elevation Model. Creating a Hillshade involves calculations that assume that the input parameters being used are in linear units, rather than the angular units that we currently have. This same problem would be true if we were to calculate slope, aspect and many other mathematical operations we might want to apply to this elevation data. What is necessary is that we “Project” our dataset from the current Latitude & Longitude Coordinates, which

Page 28: Downloading from the Seamless Data Distribution …€¦ · Web viewFlowerdew, R., and M. Green. "Developments in Areal Interpolation Methods and GIS." The Annals of Regional Science

locate features on the face of the oblate spheroid that is the earth, to a projection that records our data in linear measurements, as if the earth were flat.

14. Return to ArcToolbox and Search for “Project Raster.” Open the tool and select your ned_####### layer as the Input raster, Browse to your \Data\Raster folder and save the Output raster as ned_proj. For the Output Coordinate

System, Click the Properties Icon to open the Spatial Reference Properties

Dialog Box. Click on the Select… Button and Browse to Projected Coordinate Systems>State Plane>Nad 83> NAD 1983 StatePlane Connecticut FIPS 0600.prj. Click Add. Click OK.

15. ArcToolbox adds the new layer to our Map Document. Right-Click on the new ned_proj layer and Open the Properties. Select the Source Tab and inspect the changes to the Cellsize and Linear Units items.

16. Use the Hillshade tool again, using the new projected elevation layer, to produce a new Hillshade Layer, called hillshade_02. Be sure to set the Z Factor to 3, like before.

17. You should find that you now have a much more pleasant looking result from the Hillshade Tool.

Page 29: Downloading from the Seamless Data Distribution …€¦ · Web viewFlowerdew, R., and M. Green. "Developments in Areal Interpolation Methods and GIS." The Annals of Regional Science

Converting from the Interchange (.e00) File Format to Shapefile.

Interchange (.e00) format is a legacy data format from the days of Arc/INFO, when coverages and grids were the default data type for GIS modeling. Interchange files were a way of ‘packaging’ coverages and grids, whose essential data were distributed across more than one folder. While most GIS data is now being produced in, or has been converted to, shapefile, you will still encounter Interchange format files. ArcCatalog retains the tools necessary to convert from Interchange format.

1. Open ArcCatalog and go to View>Toolbars to enable the “ArcView 8x Tools” toolbar. You should see a toolbar appear with a single dropdown button labeled

“Conversion Tools.”

2. Launch the “Import from Interchange File” tool from the Conversion Tools.

3. Browse to the \Data\Other folder and select the newhav.e00 file as your input file.

4. For the Output File, browse to the \Data\Shapefile folder and name the output file nh_wetlands. Click Save.

5. Before you apply the conversion, Click the Batch Button , and note that this tool provides the ability to convert multiple files at once. Click OK to apply the conversion.

Page 30: Downloading from the Seamless Data Distribution …€¦ · Web viewFlowerdew, R., and M. Green. "Developments in Areal Interpolation Methods and GIS." The Annals of Regional Science

6. In ArcCatalog, Browse to the \Data\Shapefile folder, and find the nh_wetlands coverage file. In the Catalog Tree window (on the left) expand the nh_wetlands coverage layer so that the four component layers are visible. We only want the polygon layer.

7. Right-click on the polygon component of the nh_wetlands coverage and select Export>To Shapefile (Single). Give the \Data\Shapefile folder as the Output Location, and name the Output Feature Class as New_Haven_NWI_Wetlands. Leave the remaining Options as their default values. Click OK to apply the conversion.

8. Once the conversion is complete, you should see the New_Haven_NWI_Wetlands.shp appear in your \Data\Shapefiles folder.