introduction to the data quality suite an affordable, effective address management and data quality...
TRANSCRIPT
Introduction to the Data Quality Suite
An affordable, effective address management and data quality solution
Data Quality Suite
• Address Object– Parse Interface– Street Data Interface– Zip Data Interface
• Email Object =• Phone Object• Name Object
Other Data Quality Products
• GeoCoder Object - Find the lat/long for an address• Presort Object - Automate mail operations• MatchUp API - Merge/purge and dedupe lists• RightFielder 3 API - Turn messy text data into fielded data• SmartMover - Realtime NCOALink Web service
Multiplatform Capability
• Data Quality Suite is cross platform and integrates easily.
Address Object
• Verify and correct any U.S. or Canadian address
• Add ZIP+4, carrier route, delivery point, and more…
• Standardize to USPS (CASS Certified) & Canada Post (SERP) specifications
• Designed in cooperation with the mailing industry to improve the accuracy of postal automation
• Provides a common measure by which to test the quality of address matching software
• Addresses known deficiencies common across software products that require correction
Coding Accuracy Support System
• CASS Certification requires annual testing: – ZIP + 4® coding (minimum passing score is 98%)– Delivery point coding (minimum passing score is 100%)– Carrier route coding (minimum passing score is 98%)– Five-digit coding (minimum passing score is 98%)
Coding Accuracy Support System
Address Object at Work
Before
22382 emprisa, ste 100 92688
AfterCompany: Melissa DataAddress: 22382 Avenida EmpresaSuite: 100City: Rancho Santa MargaritaState: CAZIP: 92688+4: 2112Carrier Rt: C056Delivery Pt: 821County FIPS: 06059County Name: OrangeTime Zone: PacificPMSA: 5945Congress Dist: 48Results: AS01, AS12 (Verified)
Address Object Addons
• Delivery Point Validation• LACSLink• Residential and Business Delivery Indicator
(RBDI)• Early Warning System (EWS)• SuiteLink• Address Plus
DPV (Delivery Point Validation)
• Verify an address actually exists – right down to the suite or apartment number
Melissa Data
22382 Avenida Empresa
Rancho Santa Margarita, CA, 92688-2112
Status Code = AS01, AS12 (DPV Level)
Primary address and suite verified
LACSLink
• Updated “street” addresses, as converted by the 911 system
Rural Box Addressing
Street Style Addressing
PO Box Renumbering
Old Address RR 1 Box 127A 237 Park Ave PO BOX 123
New Address 17255 Scenic Rd 466 Lexington Ave PO BOX 555123
LACSLink Address Conversion Examples
RBDI
Before
22382 avenida empresa, 92688
With RBDI add-on
Delivery Indicator: B (business)
• Addresses are flagged as either a Residence or a Business.
SuiteLink
• USPS product – introduced in 2009
• Improves business addressing by appending known secondary (suite) numbers based on the company information provided.
SuiteLink At Work
Company: C ROY CONST
Address: 3401 E CHAPMAN
City: ORANGE
State: CA
ZIP: 92869
Company: C ROY CONST
Address: 3401 E CHAPMAN
Suite: Ste. A
City: ORANGE
State: CA
ZIP: 92689
Input Output - using SuiteLink
Address Plus
• Add-on to Address Object
• Appends BOTH residential and business suites
• Uses the Last Name or Company Name input
• Compiled from multiple business and residential sources
Address Plus At Work
Last Name: KAY
Company: DR SUSANA KAY
Address: 7630 E CHAPMAN
City: ORANGE
State: CA
ZIP: 92869
Last Name: KAY
Company: DR SUSANA KAY
Address: 7630 E CHAPMAN
Suite: Ste. B
City: ORANGE
State: CA
ZIP: 92689
Input Output - using Address Plus
EWS(Early Warning System)
• Warns when addresses are missing from the USPS database and will be updated in the next release
• Usually signifies new developments
• Weekly updates
EWS at Work
Next release
10601 Rhodenda Ave
Marlboro, MD, 20772-4713
Error Code = AE06 (warning)
Current release
10601 Rhodenda Ave
Marlboro, MD, 20772-4713
Status Code = AS01, AS12 (DPV validated)
Address Verification
Good vs Bad Address
Good Address
- 22382 Avenida Empresa, Rancho Santa Margarita, CA 92688
Results: AS01
Bad Address
- 22382 Fake St., Rancho Santa Margarita, CA 92688
Results: AE02
Fuzzy Matching
Correcting and Standardizing imperfect addresses.
- The address object will try to correct and verify imperfect addresses using complex matching algorithms.
- A verified address means that an actual, physical and deliverable mailbox exists. We do not make any guesses.
Fuzzy Matching
31 W MacArthur 31 W MacArthur Cres Apt E425, Cres Apt E425, Santa Ana, CA Santa Ana, CA 9270792707
31 E MacArthur 31 E MacArthur Cres Apt E425, Cres Apt E425, Santa Ana, CA Santa Ana, CA 9270792707
Missing/Incorrect Directionals
Fuzzy Matching
31 E MacArthur 31 E MacArthur Apt E425, Santa Apt E425, Santa Ana, CA 92707Ana, CA 92707
31 E MacArthur 31 E MacArthur Cres Apt E425, Cres Apt E425, Santa Ana, CA Santa Ana, CA 9270792707
Missing Suffixes
Fuzzy Matching
31 E MacArteur 31 E MacArteur Cres E425, Cres E425, Santa Ana, CA Santa Ana, CA 9270792707 31 E MacArthur 31 E MacArthur
Cres Apt E425, Cres Apt E425, Santa Ana, CA Santa Ana, CA 9270792707
Misspellings (Street and even City Names)
31 E MacArthur 31 E MacArthur Cres Apt E425, Cres Apt E425, Sana An, CASana An, CA
Bad Addresses
Interpreting Results
- The Address Object’s Results Property will not only tell you if an input address was good or bad, it also specifies what aspect of the address was incorrect.
Bad Addresses
100 Unknown St, 100 Unknown St, Santa Ana, CA Santa Ana, CA 9270792707
Results: AE02Results: AE02
- Which means - Which means ““Unknown Unknown StreetStreet””
Example: Unknown Street
Bad Addresses
Other Errors• AE01 Zip Code Error • AE02 Unknown Street • AE03 Component Mismatch Error • AE04 Non-Deliverable Address Error• AE05 Multiple Match Error • AE06 Early Warning System Error • AE07 Missing Minimum Address Input Error• AE08 Suite Range Invalid Error • AE09 Suite Range Missing Error • AE10 Primary Range Invalid Error • AE11 Primary Range Missing Error
Address Object Interfaces
The Interfaces
Address Check
Parse Street Data Zip Data
Street Data Interface
• Allows you to perform queries on the Street Data based on the street name and the zip code.
• Used for generating alternate street suggestions for front end solutions.
Generating Suggestions
• Multiple Suffixes– 1014 Laurel, Beverly Hills CA 90210
• 1014 Laurel Way, Beverly Hills CA 90210• 1014 Laurel Lane, Beverly Hills CA 90210
• Multiple Pre/Post direction– 1613 Gastel Circle, Mission TX 78572
• 1613 E Gastel Circle, Mission TX 78572• 1613 W Gastel Circle, Mission TX 78572
Generating Suggestions
• Unknown Street Due to Misspelling• 13530 Cobblst Ln, Westminster CA 92683 Unknown Street
– Find Street: “13530 Cobblst*, 92683”• Wild Card Search (*): Allows you to search for street
names that begin with the specified string.
Generating Suggestions
Upcoming Feature
- NOTE: The FindSuggestions functionality is set for release. This feature automatically generates street suggestions for you.
Zip Data Interface
• Allows you to do the following queries:– Zip in City
• Return zip code Information based off the city.
– City in Zip• Return city information based off the zip.
– City in State• Return cities in a given state.
Parse Interface
• Parse out individual address components
22382 Avenida 22382 Avenida Empresa, Empresa, 9268892688
• Range = 22382
• StreetName = Empresa
• Suffix = Avda
Other Data Quality Services
GeoCoder Object with GeoPoints
– GeoCoder will retrieve Lat/Long information at the Zip+4 level.– GeoPoints will retrieve Lat/Long information at the Rooftop
level based off the zip, plus4 and Delivery Point Code (DPC).– Also returns other GeoData elements such as the CBSA,
County Information and Census.
GeoObject at Work
92688-2112-82
Input Output
Census Block: 1002County Name: OrangeLatitude: 33.637519Longitude: -117.606924CBSA Code: 31100CBSA Division: Los Angeles-Long Beach-
Santa Ana, CAPlace Name: Rancho Santa MargaritaResults: GS05 (Information at the
rooftop level)
Email Object
• Validate, correct and standardize email addresses– Check and correct general format syntax errors– Check for and correct top level domain name (TLD)– Detect improper email address misspellings for common
domains (ie:Hotmail, AOL, Yahoo)– Verify whether the domain name exists through the
MaileXchange (MX) Lookup for realtime feedback– Parse email addresses into various components
Email Object at Work
Before
Bud@@melissadata.con
After
TLD: comDomain: melissadataTLD Desc: Operated by VerisignEmail: [email protected]: V (valid)
• Syntax was corrected• Top Level Domain was corrected
Phone Object
• 10 Digit level of verification: Data Source from Telco Data (411 Data).
• Update area codes that have split
• Parse out the phone number to individual components
• Determine whether a phone number is a Business, Residential or Home Office, as well as if it was a Cell, Landline or VOIP.
Phone Object at Work
714 589-5200 x113
Before After
Area Code: 714Prefix: 589Suffix: 5200Extension: 113New Area Code: 949City: Garden GroveState: CACounty FIPS: 06059County: OrangeLatitude: 33.772800Longitude: -117.937897
Name Object
• Parse a single or dual names into its individual components
• Identify gender based on first name
• Catch vulgar words
• Add personalized salutations
Name Object at Work
dr john wayne brown phd
BeforePrefix: DrFirst Name: JohnMiddle Name: WayneLast Name: BrownSuffix: PhDGender: M
After
Database Updates
National Data Files Monthly
DPV Monthly
LACSLink Monthly
RBDI Bi-Monthly
SuiteLink Monthly
SuiteFinder Bi-Monthly
Canada Monthly
GeoCoder Quarterly
Email Bi-Monthly
Name Bi-Monthly
Phone Bi-Monthly
IPLocator Quarterly
WebSmart Services
WebSmart Products
• Address Check– All addons are initialized including Canadian
• Street Search• Zip Search• RBDI• Geo Coder
– Built in Canadian• IPLocator• Email Check• Phone Check• Name Check
Overview
• Uses only Secure Sockets Layer (SSL) connections.
• Our service uses Windows Communication Framework (WCF) which is faster and more stable then previous ASP.NET implementation.
• Available through SOAP, XML or REST protocols.• Web service uses the Local APIs as its driving
engine.