web viewthe benefits of using simpler vocabularies are exemplified with our 20-album database. in...

32
Running head: STRUCTURING A DB/TEXTWORKS DATABASE 1 Structuring A DB/TextWorks Database Allison Higgins, Juan Jaime, Jacob Kubrin, Natalie Parker and Jen Pengelly San Jose State University

Upload: dodien

Post on 30-Jan-2018

214 views

Category:

Documents


1 download

TRANSCRIPT

Page 1: Web viewThe benefits of using simpler vocabularies are exemplified with our 20-album database. In the first assignment, we discussed the use of a faceted controlled

Running head: STRUCTURING A DB/TEXTWORKS DATABASE 1

Structuring A DB/TextWorks Database

Allison Higgins, Juan Jaime, Jacob Kubrin, Natalie Parker and Jen Pengelly

San Jose State University

Page 2: Web viewThe benefits of using simpler vocabularies are exemplified with our 20-album database. In the first assignment, we discussed the use of a faceted controlled

STRUCTURING A DB/TEXTWORKS DATABASE 2

Structuring A DB/TextWorks Database

In constructing our database, our group looked at 14 terms, all of which are outlined in

the next section. We decided that most of our terms were going to be described by either a flat

term list or would not have a controlled vocabulary associated with them. The two exceptions

were genre, described by a hierarchical term list, and artist utilizing an authority file. The result

is an effective and workable database that allows users to efficiently search for one of our 20

albums.

The benefits of using simpler vocabularies are exemplified with our 20-album database.

In the first assignment, we discussed the use of a faceted controlled vocabulary to describe our

term “genre.” However, in actually crafting the database, we found that a hierarchical list with

three levels would be better suited for our search engine. For example, if a user conducted a

narrow search for the genre of “indie,” albums would be shown under a classification of the

umbrella term “rock” and sub-classification of “alternative.” If the user decided to conduct a

search using the broader term of “rock,” the albums returned would include the sub-

classifications of “alternative,” “classic,” “folk,” and “blues.” The drawback to using the

hierarchical list rather than a faceted controlled vocabulary is that our database does not provide

the dynamic browsing experience we had originally envisioned, similar to how a customer would

search for a place to eat on Yelp or Zagat. However, given the limited number of entries

available to each term, this was not a practical model for our database as it would have required

more granularity, but without a significant difference in recall.

For our term “artist,” we continued with our idea of an authority file so that a search

could return a preferred term even when an alternate term was entered, for instance a birth name

Page 3: Web viewThe benefits of using simpler vocabularies are exemplified with our 20-album database. In the first assignment, we discussed the use of a faceted controlled

STRUCTURING A DB/TEXTWORKS DATABASE 3

or nickname for a single artist, or an individual artist within a group. There are only two levels in

this list—preferred term and other terms.

It should also be noted that in programming our database, we included stopwords as they

appeared in an artist’s name, album title or song title, however the database will ignore those

words when searching. A complete list of the stopwords the program will ignore appears in

“Structuring the DB/TextWorks Database.” The terms we identified and their definitions are:

1. Team Member: indicates the name of the team member who entered the record. The

controlled vocabulary is a flat term list as there is no overlap between terms. In this case

the entries were limited to our five team members (Allison, Jake, Jen, Juan and Natalie).

2. Artist: describes the name of the artist or group including stopwords as they appear in the

title (i.e. The National). The controlled vocabulary used to describe this term is an

authority file. The entry in the database is the preferred term.

3. Other Artist Names: posits a list of other possible artist names under this term including

—but not limited to—original names, stage monikers and spin-off artists with stopwords

entered into the database as they might appear. Each entry is a separate record in the

thesaurus tool of the database. This term is related to the “Artist” term as “Other Artists

Names” represents the “other terms” as outlined under the rules of an authority file

(Controlled vocabulary/terminology concepts, 2004). A search for an “other term” would

recall the associated preferred term.

4. Album Title: to be entered with stopwords as they appear, and we do not use a controlled

vocabulary to describe this term as we know that we anticipate adding more albums to

our database in the future.

Page 4: Web viewThe benefits of using simpler vocabularies are exemplified with our 20-album database. In the first assignment, we discussed the use of a faceted controlled

STRUCTURING A DB/TEXTWORKS DATABASE 4

5. Genre: utilizes a hierarchical term list to define the explicit relationship between parent

terms and child terms—not all entries have child relationships and recall would show the

relationship if one existed.

6. Format: indicates the album format in-stock at our store—entries were limited to CD,

Cassette, MP3 Download, and Vinyl so this term will utilize a flat term list with no

overlap between the terms.

7. Number of Tracks: describes the number of tracks that appear on the album, but there is

no need for a controlled vocabulary.

8. Name of Tracks: assumes no controlled vocabulary. The name of each track was entered

into this single field with stopwords entered as they appeared and a F7 break in between

each track name. This makes the database searchable by track for the purpose of

returning artist or album information.

9. Length of album: presented in minutes: seconds (##:##), this term describes the length of

the album. In the case of an album that is over one hour in length, the preferred method is

“64:10” instead of “1:04:10.” There is no controlled vocabulary for this term.

10. Popularity: Ultimately, we did not include this term in our database. We had wanted to

base this on Billboard ratings, but for many of our albums there was no chart information.

We thought it best to exclude this category for that reason. We had envisioned this term

as a numerical value equal to the highest position the album achieved on Billboard’s

album chart with an n/a value for albums that did not rank.

11. Record Label: indicates the name of the record label responsible for production of the

album, rather than for associated artists. We did not use a controlled vocabulary, although

we conceived that could be possible to utilize a hierarchical list instead. However, we

Page 5: Web viewThe benefits of using simpler vocabularies are exemplified with our 20-album database. In the first assignment, we discussed the use of a faceted controlled

STRUCTURING A DB/TEXTWORKS DATABASE 5

decided not to due to possible transitions in ownership of subsidiaries meaning that the

information will have longer shelf-life without the relational value.

12. Retail Price: the suggested price of the item and cost of the album to the store. We do not

use a controlled vocabulary to describe this term.

13. Wholesale Price: showcases the price at which the store purchased the album. We

generally assumed a 50 - 70% markup. Again, we did not employee a controlled

vocabulary to describe this term list.

14. Sale Price: describes the price at which the store actually sells the album, again without

utilizing a controlled vocabulary.

Structuring the DB/TextWorks Database

Database Structure

Textbase Structure

Textbase Information

Textbase: C:\Users\Natalie\Documents\SLIS\DBTextWorks\LIBR202 Group 2 Music Database\LIBR202 Group 2 Music DatabaseCreated: 10/2/2011 4:20:31 PMModified: 10/2/2011 5:09:42 PM

Field Summary:1. Item Number: Automatic Number(next avail=21, increm=1), Term2. Artist: Text, Term & Word  Thesaurus: C:\Users\Natalie\Documents\SLIS\DBTextWorks\LIBR202 Group 2 Music Database\Artist Thesaurus  Validation: required3. Album Title: Text, Term & Word4. Genre: Text, Term & Word  Thesaurus: C:\Users\Natalie\Documents\SLIS\DBTextWorks\LIBR202 Group 2 Music Database\Genre Hierarchy  Validation: required, valid-list5. Format: Text, Term & Word  Validation: required, valid-list6. Number of Tracks: Number, Term

Page 6: Web viewThe benefits of using simpler vocabularies are exemplified with our 20-album database. In the first assignment, we discussed the use of a faceted controlled

STRUCTURING A DB/TEXTWORKS DATABASE 6

  Validation: required7. Album Length: Number, Term  Validation: required8. Track Names: Text, Term & Word  Validation: required9. Record Label: Text, Term & Word  Validation: required10. Retail Price: Number, Term  Validation: required11. Wholesale Price: Number, Term  Validation: required12. Sale Price: Number, Term  Validation: required13. Team Contributor: Text, Term & Word  Validation: required, valid-list

Log file enabled, showing 'Item Number'Leading articles: a an theStop words: a an and by for from in of the toXML Match Fields:1. Item Number

Textbase Defaults:Default indexing mode: SHARED IMMEDIATEDefault sort order: <none>Textbase passwords:Master password = ''0 Access passwords:No Silent password

Validation Lists

Format:CassetteCDMP3 DownloadVinyl

Team Member:Allison HigginsJake KubrinJen PengellyJuan JaimeNatalie Parker

Genre:

Page 7: Web viewThe benefits of using simpler vocabularies are exemplified with our 20-album database. In the first assignment, we discussed the use of a faceted controlled

STRUCTURING A DB/TEXTWORKS DATABASE 7

Alternative RockBluesClassic RockElectronicFolk RockFrench CabaretHip HopIndie RockJazzLatin PopMariachiNew AgePopSka

Database Records:

Item Number 1Artist SublimeAlbum Title 40 Oz to FreedomGenre SkaFormat MP3 DownloadNumber of Tracks 22Album Length 69:15Track Names Waiting for My Ruca

40 Oz. to FreedomSmoke Two JointsWe're Only Gonna Die for Our ArroganceDon't Push5446 Thats My Number / Ball and ChainBadfishLet's Go Get StonedNew ThrashScarlet BegoniasLive at E'sD.J.'sChica Me TipoRight BackWhat HappenedNew SongEbinDate RapeHopeKRS-OneRivers of BabylonThanx

Page 8: Web viewThe benefits of using simpler vocabularies are exemplified with our 20-album database. In the first assignment, we discussed the use of a faceted controlled

STRUCTURING A DB/TEXTWORKS DATABASE 8

Record Label McaRetail Price $9.49Wholesale Price $7.00Sale Price $8.50Team Contributor Natalie ParkerItem Number 2Artist Sonic YouthAlbum Title GooGenre Alternative RockFormat CDNumber of Tracks 11Album Length 49:23Track Names Dirty Boots

Tunic (Song for Karen)Mary-ChristKool ThingMoteMy Friend GooDisappearerMildred PierceCinderella’s Big ScoreScooter + JinkTitanium Exposé

Record Label Geffen RecordRetail Price $11.99Wholesale Price $7.99Sale Price $9.99Team Contributor Jake KubrinItem Number 3Artist Bob DylanAlbum Title Highway 61 RevisitedGenre Folk RockFormat CDNumber of Tracks 9Album Length 51:26Track Names Like a Rolling Stone

Tombstone BluesIt Takes a Lot to Laugh, It Takes a Train toCryFrom a Buick 6Ballad of a Thin ManQueen Jane ApproximatelyJust Like Tom Thumb's BluesDesolation Row

Record Label Columbia RecordsRetail Price $13.99

Page 9: Web viewThe benefits of using simpler vocabularies are exemplified with our 20-album database. In the first assignment, we discussed the use of a faceted controlled

STRUCTURING A DB/TEXTWORKS DATABASE 9

Wholesale Price $9.99Sale Price $11.99Team Contributor Jake KubrinItem Number 4Artist Grizzly BearAlbum Title Yellow HouseGenre Indie RockFormat CDNumber of Tracks 10Album Length 50:00Track Names Easier

LullabyeKnife Central and RemoteLittle BrotherPlansMarlaOn a Neck, On a SpitRepriseColorado

Record Label Warp RecordsRetail Price $8.99Wholesale Price $4.99Sale Price $6.99Team Contributor Jake KubrinItem Number 5Artist Flying LotusAlbum Title 1983Genre ElectronicFormat CDNumber of Tracks 11Album Length 29:58Track Names 1983

São PauloBad ActorsOrbit BrazilShiftyBabblePet Monster ShotglassHelloUntitled #7Unexpected Delight1983 [Daedalus Odd-Dance Party Remix]

Record Label Warp RecordsRetail Price $6.99Wholesale Price $4.99

Page 10: Web viewThe benefits of using simpler vocabularies are exemplified with our 20-album database. In the first assignment, we discussed the use of a faceted controlled

STRUCTURING A DB/TEXTWORKS DATABASE 10

Sale Price $6.99Team Contributor Jake KubrinItem Number 6Artist Lady GagaAlbum Title The Fame MonsterGenre PopFormat CDNumber of Tracks 10Album Length 34:11Track Names Bad Romance

AlejandroMonsterSpeechlessDance in the DarkTelephoneSo Happy I Could DieTeethBad Romance (Starsmith Remix)Digital Booklet - The Fame Monster

Record Label InterscopeRetail Price $12.00Wholesale Price $4.12Sale Price $7.99Team Contributor Allison HigginsItem Number 7Artist EnyaAlbum Title And Winter Came...Genre New AgeFormat CDNumber of Tracks 12Album Length 44:59Track Names And Winter Came

Journey of the AngelsWhite is in the Winter NightO Come, O Come EmmanuelTrains and Winter RainsDreams Are More PreciousLast Time by MoonlightOne Toy SoldierStars and Midnight BlueThe Spirit of Christmas PastMy! My! Time Flies!Oiche Chiuin [Chorale]

Record Label Warner Music UK Ltd.Retail Price $13.54Wholesale Price $4.17

Page 11: Web viewThe benefits of using simpler vocabularies are exemplified with our 20-album database. In the first assignment, we discussed the use of a faceted controlled

STRUCTURING A DB/TEXTWORKS DATABASE 11

Sale Price $10.99Team Contributor Allison HigginsItem Number 8Artist Foster the PeopleAlbum Title TorchesGenre Alternative RockFormat CDNumber of Tracks 12Album Length 38:28Track Names Helena Beat

Pumped Up KicksCall It What You WantDon't Stop (Color on the Walls)WasteI Would Do Anything for YouHoudiniLife on the NickelMiss YouWarrant

Record Label Sony MusicRetail Price $9.99Wholesale Price $8.70Sale Price $9.00Team Contributor Allison HigginsItem Number 9Artist ShakiraAlbum Title Laundry ServiceGenre Latin PopFormat CDNumber of Tracks 13Album Length 49:16Track Names Objection (Tango)

Underneath Your ClothesWhenever, WhereverRulesThe OneReady for the Good TimesFoolTe Dejo MadridPoem to a HorseQue Me Quedes TuEyes Like Yours (Ojos Asi)Suerte (Whenever, Wherever)Te Aviso, Te Anuncio (Tango)

Record Label Sony MusicRetail Price $9.99

Page 12: Web viewThe benefits of using simpler vocabularies are exemplified with our 20-album database. In the first assignment, we discussed the use of a faceted controlled

STRUCTURING A DB/TEXTWORKS DATABASE 12

Wholesale Price $4.20Sale Price $8.60Team Contributor Allison HigginsItem Number 10Artist NujabesAlbum Title Modal SoulGenre Hip HopFormat CDNumber of Tracks 14Album Length 63:25Track Names Feather

Ordinary JoeReflection EternalLuv (Sic) PT3Music is MineEclipseSignThank YouWorld's End RhapsodyModal SoulFlowerSea of CloudLight On the LandHorizon

Record Label Hyde-Out ProductionsRetail Price $19.99Wholesale Price $14.99Sale Price $17.50Team Contributor Juan JaimeItem Number 11Artist Lupe FiascoAlbum Title The CoolGenre Hip HopFormat MP3 DownloadNumber of Tracks 20Album Length 70:44Track Names Baba Says Cool for Thought

Free ChillyGo Go Gadget FlowThe CoolestSuperstarParis, TokyoHi-DefinitionGold WatchHip-Hop Saved My LifeIntruder Alert

Page 13: Web viewThe benefits of using simpler vocabularies are exemplified with our 20-album database. In the first assignment, we discussed the use of a faceted controlled

STRUCTURING A DB/TEXTWORKS DATABASE 13

Streets on FireLittle WeaponGotta EatDumb It DownHello/Goodbye (Uncool)The DiePut You On GameFightersGo BabyBlackout

Record Label AtlanticRetail Price $11.99Wholesale Price $7.25Sale Price $9.99Team Contributor Juan JaimeItem Number 12Artist Fats DominoAlbum Title Fats is BackGenre JazzFormat CDNumber of Tracks 11Album Length 29:51Track Names My Old Friends

I'm ReadySo Swell When You're WellWait Till It Happens to YouI KnowLady MadonnaHonest Papas Love Their Mamas BetterMake Me Belong to YouOne for the HighwayLove RitaOne More Song for You

Record Label Bullseye BluesRetail Price $9.99Wholesale Price $5.99Sale Price $7.75Team Contributor Juan JaimeItem Number 13Artist Eric ClaptonAlbum Title The Cream of ClaptonGenre BluesFormat CDNumber of Tracks 19Album Length 79:51Track Names I Feel Free

Page 14: Web viewThe benefits of using simpler vocabularies are exemplified with our 20-album database. In the first assignment, we discussed the use of a faceted controlled

STRUCTURING A DB/TEXTWORKS DATABASE 14

Sunshine of Your LoveWhite RoomCrossroadsBadgeBlind FaithBlues PowerAfter MidnightLet it RainBell Bottom BluesLaylaI Shot the SheriffLet it GrowKnockin' on Heaven's DoorHello Old FriendCocaineWonderful TonightPromisesI Can't Stand It

Record Label PolydorRetail Price $11.99Wholesale Price $7.99Sale Price $9.50Team Contributor Juan JaimeItem Number 14Artist Miles DavisAlbum Title MilestonesGenre JazzFormat VinylNumber of Tracks 6Album Length 47:36Track Names Dr. Jackle (with Jackie McLean)

Sid's Ahead (Miles Davis)Two Bass Hit (with John Lewis - DizzyGillespie)MilestonesBilly BoyStraight, No Chaser (with Thelonious Monk)

Record Label Columbia RecordsRetail Price $19.99Wholesale Price $7.99Sale Price $19.99Team Contributor Jen PengellyItem Number 15Artist Fleetwood MacAlbum Title RumorsGenre Classic Rock

Page 15: Web viewThe benefits of using simpler vocabularies are exemplified with our 20-album database. In the first assignment, we discussed the use of a faceted controlled

STRUCTURING A DB/TEXTWORKS DATABASE 15

Format VinylNumber of Tracks 11Album Length 39:03Track Names Second Hand News

DreamsNever Going Back AgainDon't StopGo Your Own WaySongbirdThe ChainYou Make Loving FunI Don't Want to KnowOh DaddyGold Dust Woman

Record Label Warner Bros.Retail Price $24.99Wholesale Price $9.99Sale Price $24.99Team Contributor Jen PengellyItem Number 16Artist Edith PiafAlbum Title Non Je Ne Regrette RienGenre French CabaretFormat CDNumber of Tracks 40Album Length 96:47Track Names Mon Manege A Moi

Le Droit d'AimerNon je ne regrette rienQuoi Ca Sert L'amourMilordComme MoiC'est A HambourgHeureuseY'avait Du SoleilLe Petit Monsieur TristeLes Deux CopainsSue une collineY'en A Un De TropEmbrasse-MoiJimmy C'est LuiJ'Ai Danse Avec l'AmourOu Sont-Ils Mes Petits Copains?C'etait Un Jour De FeteL'Homme des BarsSimple Comme Bonjour

Page 16: Web viewThe benefits of using simpler vocabularies are exemplified with our 20-album database. In the first assignment, we discussed the use of a faceted controlled

STRUCTURING A DB/TEXTWORKS DATABASE 16

La Vie En Rose    L'Hymne a l'Amour

Plus Bleu que Tes YeuxL'Homme a La MotoLa Goualante du Pauvre JeanJezebelPadam PadamJohnny Tu N'es Pas Un AngeLes Amants d'Un JourLes CroixBravo Pour Le ClownJe Hais Les DimanchesSous le ciel de ParisCa IraL'AccordeonisteElle Frequentait La Rue PigalleC'est de La Faute a Tes YeuxN'Y Vas Pas ManuelMon LegionnaireLes Trois Cloches

Record Label 101 DistributionRetail Price $15.99Wholesale Price $9.59Sale Price $15.79Team Contributor Jen PengellyItem Number 17Artist The NationalAlbum Title High VioletGenre Indie RockFormat CDNumber of Tracks 11Album Length 47:40Track Names Terrible Love

SorrowAnyone's GhostLittle FaithAfraid of EveryoneBloodbuzz OhioLemonworldRunawayConversation 16EnglandVanderlyle Crybaby Geeks

Record Label 4ad RecordsRetail Price $15.99Wholesale Price $6.39

Page 17: Web viewThe benefits of using simpler vocabularies are exemplified with our 20-album database. In the first assignment, we discussed the use of a faceted controlled

STRUCTURING A DB/TEXTWORKS DATABASE 17

Sale Price $11.99Team Contributor Jen PengellyItem Number 18Artist Mariachi El BronxAlbum Title IIGenre MariachiFormat CDNumber of Tracks 12Album Length 44:59Track Names 48 Roses

Great ProviderRevolution GirlsFallenNorteno LightsMariachi El Bronx (with Mariachi Reyna De LosAngeles)Map of the WorldBodies of ChristPoverty's KingMatadorEverything DiesSpread Thin

Record Label ATO RecordsRetail Price $11.42Wholesale Price $8.00Sale Price $11.00Team Contributor Natalie ParkerItem Number 19Artist Johnny CashAlbum Title At Folsom PrisonGenre Folk RockFormat CassetteNumber of Tracks 19Album Length 55:56Track Names Folsom Prison Blues

BustedDark as a DungeonI Still Miss SomeoneCocaine Blues25 Minutes to GoOrange Blossom SpecialThe Long Black VeilSend a Picture of MotherThe WallDirty Old Egg-Suckin' Dog

    Flushed from the Bathroom of Your Heart

Page 18: Web viewThe benefits of using simpler vocabularies are exemplified with our 20-album database. In the first assignment, we discussed the use of a faceted controlled

STRUCTURING A DB/TEXTWORKS DATABASE 18

Joe BeanJackson (with June Carter)Give My Love to Rose (with June Carter)I Got StripesThe Legend of John Henry's Hammer (with JuneCarter)Green, Green Grass of HomeGreystone Chapel

Record Label Columbia RecordsRetail Price $8.28Wholesale Price $5.00Sale Price $8.00Team Contributor Natalie ParkerItem Number 20Artist Green DayAlbum Title 21st Century BreakdownGenre Alternative RockFormat CDNumber of Tracks 18Album Length 69:17Track Names Song of the Century

21st Century BreakdownKnow Your Enemy¡Viva la Gloria!Before the LobotomyChristians InfernoLast Night on EarthEast Jesus NowherePeacemakerLast of the American GirlsMurder City¿Viva La Gloria? (Little Girl)Restless Heart SyndromeHorseshoes and HandgrenadesThe Static Age21 GunsAmerican EulogySee the Slight

Record Label RepriseRetail Price $9.63Wholesale Price $8.00Sale Price $8.99Team Contributor Natalie Parker

Page 19: Web viewThe benefits of using simpler vocabularies are exemplified with our 20-album database. In the first assignment, we discussed the use of a faceted controlled

STRUCTURING A DB/TEXTWORKS DATABASE 19

Reflections

Creating our own vocabulary produced more challenges than we originally thought. The

topic seemed relatively easy—developing a music store—but it was amazing to find out how

much thought and time was really put into the task. For instance, we had to go beyond just

picking four albums with different genres. That part was easy, and we had a lot of fun making

the selections. However, putting the store together went far beyond simply choosing album

names and artists. We also had to choose the genre, the retail, wholesale and sale prices, the

popularity, the record label, and the length of each album. This was more difficult because it

involved “hidden” work that required research on music sites such as iTunes and Billboard’s Top

100. The research included finding the total times of the CDs and the three prices. Because

iTunes and Billboard did not always have the information we needed, we supplemented our base

of knowledge with information from Amazon.com, locating the highest, lowest, and middle

prices for each album (Amazon music, 1996). On several occasions, the total times on each CD

had to be identified by adding up the individual track times on a calculator. We simply typed in

the name of each artist and then clicked on the link to the appropriate album (Amazon music,

1996). We felt that the prices and the album times were the hardest, and so was the popularity.

We were unable to locate the total time and the popularity both on Billboard and iTunes and with

the latter could often only find the sale price of the albums (Billboard charts, 2011.). On

occasion, if iTunes offered a discount on an album, they would state the retail price, but when

only one price was listed, we were unsure whether the prices were the suggested retail or the sale

price.

Creating a description of our inventory using our own vocabulary was somewhat

challenging, but also a helpful experience in database construction. There was quite a bit of

Page 20: Web viewThe benefits of using simpler vocabularies are exemplified with our 20-album database. In the first assignment, we discussed the use of a faceted controlled

STRUCTURING A DB/TEXTWORKS DATABASE 20

diversity in genre amongst our inventory as it was a random selection from each team member.

Not surprisingly, there were some questions about the genre of “rock”. Since we were able to

create a hierarchical classification scheme in the DBTextWorks program for the descriptors of

genre, we could more clearly define each term we used. In turn, our organization became much

clearer. Instead of having equivalent terms like “alternative rock” and “indie rock,” we were able

to show that “indie rock” and also “ska” were a type of genre underneath the parent term of

“alternative rock.” This helped us to also grow our vocabulary and achieve richer descriptions.

Juggling controlled vocabularies is an extensive process that an information professional

must perform in order to make information retrieval easier. A certain type of controlled

vocabulary used must be able to suit the terms that make up our database. In our case, the initial

flat term list we used for “rock” was too vague and unreliable. Our analysis proved this problem

very early on because there were some overlapping terms that needed an alternative type of

controlled vocabulary. That is why it was important to consider various types of controlled

vocabularies so as to create a useful and rich database.

Using various types of controlled vocabularies is useful in creating a database. However,

in the beginning, we were very single-minded in creating a very basic search function while only

utilizing one or two types of controlled vocabularies. However, after our group met up to draft

the project and to create the basis for a genuine search engine, we were able to better understand

how we should be very selective in the utilization of multiple controlled vocabularies for an

operational database. For example, our group used an authority file to describe the term artist

that allowed for a preferred term in our database when searching for the correct artist, but we did

not use an authority file in our term list for format (Controlled vocabulary/terminology concepts,

2004). This was due to the fact that a flat term list would be more ideal for describing format,

Page 21: Web viewThe benefits of using simpler vocabularies are exemplified with our 20-album database. In the first assignment, we discussed the use of a faceted controlled

STRUCTURING A DB/TEXTWORKS DATABASE 21

ensuring that no overlap between terms could occur and possibly dilute our database with

undesirable results. This was crucial because for the success of our basic database, different

controlled vocabularies were used to make each term suitable in our database.

The usage of multiple controlled vocabularies within the structure of our database

allowed for user-friendly terms to be entered making it more flexible for our customers who

would search and potentially buy any of our 20 albums. This meant that some of the terms our

group selected could be used through a designated controlled vocabulary to accurately identify

the preferred album. Furthermore, as a group, we were able to successfully explain our terms

through suitable controlled vocabularies that allowed us to work resourcefully within the

boundaries of DBTextworks. This program offered great potential for the creation of our

database, but also created barriers in its simplicity (An Introduction to DB/TextWorks, 2010).

Thus, the usage of controlled vocabularies for some of our terms met our expectations as we

constructed our database to perform effectively and efficiently.

Accountability

Our group’s individual responsibilities allowed for our project to run smoothly and

effectively. Our group members divided the assignment as follows: we met as a group to discuss

Part 1 of the assignment, and to brainstorm ideas for the term list and rules. We then set up a

shared GoogleDocs spreadsheet with columns for each term in the database where each team

member was responsible for entering the information of their chosen four albums. Natalie

constructed the database in DBTextWorks program using the terms we identified on our

GoogleDocs spreadsheet. She also worked with individual group members if there were

problems putting our vocabulary ideas into practice. Jen wrote part one of this document,

Page 22: Web viewThe benefits of using simpler vocabularies are exemplified with our 20-album database. In the first assignment, we discussed the use of a faceted controlled

STRUCTURING A DB/TEXTWORKS DATABASE 22

explaining the terms. In section three, Allison answered the first question, Jake answered the

second question, and Juan answered the third question. Allison also completed this

accountability section. All team members were responsible for review of the final product.

Page 23: Web viewThe benefits of using simpler vocabularies are exemplified with our 20-album database. In the first assignment, we discussed the use of a faceted controlled

STRUCTURING A DB/TEXTWORKS DATABASE 23

REFERENCES

Amazon music. (1996). In Amazon.com. Retrieved from

http://www.amazon.com/s/ref=nb_sb_noss?url=search-alias%3Dpopular&field-keywords

=&x=18&y=15

An introduction to DB/TextWorks. (2010). Retrieved from

http://slisweb.sjsu.edu/courses/202/index.html

Billboard charts. (2011). In Billboard.com. Retrieved from http://www.billboard.com/#/charts

Cash, J. (Performer). At folsom prison [Casette]. Columbia records.

Clapton, E. (Performer). (1995). The cream of clapton [CD]. Polydor.

Controlled vocabulary/terminology concepts. (2004, September 17). In Digital library for earth

system education. Retrieved from

http://www.dlese.org/Metadata/vocabularies/term_expln.php

Cool, the. (n.d). In Amazon.com. Retrieved from http://www.amazon.com/Lupe-Fiascos-The-

Cool-Explicit/dp/B001230T0K/ref=sr_shvl_album_1?

ie=UTF8&qid=1317965765&sr=301-1

Davis, M. (Performer). (2009). Milestones [Vinyl]. Columbia records

DB/TextWorks 13.0. (Demo Version) [Software]. Available from

slisweb.sjsu.edu/courses/restricted/DBTextWorksV13.exe 

Dylan, B. (Performer). (1965). Highway 61 revisited [CD]. Columbia records

Édith Piaf. (n.d.). In Wikipedia. Retrieved from http://en.wikipedia.org/wiki/%C3%89dith_Piaf

Page 24: Web viewThe benefits of using simpler vocabularies are exemplified with our 20-album database. In the first assignment, we discussed the use of a faceted controlled

STRUCTURING A DB/TEXTWORKS DATABASE 24

Édith Piaf - Non Je Ne Regrette Rien CD. (n.d.). In CD universe. Retrieved from

http://www.cduniverse.com/search/xx/music/pid/6836196/a/Non+Je+Ne+Regrette+Rien.

htm

Enya. (Performer). (2008). And winter came… [CD]. Warner music U.K. ltd.

Fats Domino (Performer). (1999) Fats is back [CD]. Bullseye blues.

Fleetwood Mac. (Group). (1977). Rumours [Vinyl]. Warner bros.

Flying Lotus. (Performer). (2006). 1983 [CD]. Warp records.

Foster the People. (Group). (2011). Torches [CD]. Sony music.

Greenday. (Group). (2009). 21st Century Breakdown [CD]. Reprise.

Grizzly Bear. (Group). (2006). Yellow House [CD]. Warp records.

Itunes. (Version 10.4.1). [Software]. Available from http://www.apple.com/itunes/download/

Lady Gaga. (Performer). (2009). The fame monster [CD]. Interscope.

Lupe Fiasco. (Performer). (2007) The cool [CD]. Atlantic

Mariachi El Bronx (Group). (2011). II [CD]. ATO records.

National, the. (Group). (2010). High violet [CD]. 4ad records.

Non je ne regrette rien. (n.d.). In Amazon.com. Retrieved from http://www.amazon.com/Non-Je-

Ne-Regrette-Rien/dp/B0000285SO

Nujabes (Performer). (2005). Modal soul [CD]. Hyde-out productions.

Piaf, E. (Performer). (1996). Non je ne regrette rien [CD]. 101 distribution.

Page 25: Web viewThe benefits of using simpler vocabularies are exemplified with our 20-album database. In the first assignment, we discussed the use of a faceted controlled

STRUCTURING A DB/TEXTWORKS DATABASE 25

Shakira. (Performer). (2001). Laundry service [CD]. Sony music.

Sonic Youth. (Group). (1990). Goo [CD]. Geffen record.

Sublime. (Group). (1992). 40 oz to freedom [MP3 Download] Mca.