focusing on the individual: find, build, suggest

25
Focusing on the Individual: Find, Build, Suggest Anurag Acharya

Upload: scielo-scientific-electronic-library-online

Post on 24-Apr-2015

506 views

Category:

Education


2 download

DESCRIPTION

Grande parte da comunicação científica é centrada no indivíduo. Como pesquisadores, exploramos outros trabalhos de autores de artigos interessantes; como editores, selecionamos revisores especialistas em áreas particulares; como financiadores, buscamos pesquisadores com currículo comprovado. Apresentarei aspectos do Google Acadêmico com foco no pesquisador individual. Descreverei nossa abordagem para a desambiguação de autores, permitindo mesmo a autores prolíficos criar rapidamente um perfil com todos os seus artigos. Apresentarei a integração dos perfis de autores individuais nos resultados de busca do Google Acadêmico, que facilita a exploração de outros trabalhos de autores-chave pelos usuários. Por fim, descreverei nossas recomendações personalizadas, que analisam publicações de um autor para identificar automaticamente artigos recentemente publicados, que possam ser de seu interesse. Much of scholarly communication is centered around individuals. As researchers, we explore other work by authors of interesting articles, as editors, we seek out reviewers knowledgeable in particular areas, as funders, we look for researchers with a proven track record. I will present features in Google Scholar that focus on the individual researcher. I will describe our approach to author disambiguation which allows even prolific authors to quickly set up a profile with all their articles. I will present integration of individual author profiles into Scholar search results which make it easy for users to explore other work by key authors. Finally, I will describe our personalized recommendations which analyze an author’s publications to automatically identify recently published articles that are likely to be of her interest. Gran parte de la comunicación académica se centra en los individuos. Como investigadores, exploramos trabajos de otros autores de artículos interesantes, como editores, buscamos revisores con conocimientos en áreas específicas, como financiadores, buscamos investigadores con una trayectoria comprobada. Voy a presentar características en Google Académico que se centran en el investigador individual. Voy a describir nuestro enfoque para la desambiguación de autores que permite que incluso los autores prolíficos configuren rápidamente un perfil con todos sus artículos. Voy a presentar la integración de perfiles de autor individuales en los resultados de Google Scholar, que hacen que sea fácil para los usuarios explorar otros trabajos de autores clave. Por último, voy a describir nuestras recomendaciones personalizadas que analizan las publicaciones de un autor para identificar automáticamente los artículos recientemente publicados que pueden ser de su interés.

TRANSCRIPT

Page 1: Focusing on the Individual: Find, Build, Suggest

Focusing on the Individual:

Find, Build, Suggest

Anurag Acharya

Page 2: Focusing on the Individual: Find, Build, Suggest

Science is anchored by scientists

Keeping up with significant advances

– We look for recent work by key authors

Evaluating quality of new research

– We identify & invite experts to review

Funding impactful research

– We look for investigators with a proven record

Page 3: Focusing on the Individual: Find, Build, Suggest

Scholars on Google Scholar

Challenges

– Identify publications by an author

– Keep them up-to-date

– Pivot from articles to authors and back

Opportunities

– Find me what I need to read

– Identify key researchers in areas

Page 4: Focusing on the Individual: Find, Build, Suggest

Overview

Author disambiguation

– Quickly setup author profile, auto-maintain

Integration with Scholar search

– Work by key authors

– Find experts in an area

Personalized recommendations

– Find me what I need to read

Page 5: Focusing on the Individual: Find, Build, Suggest

Author disambiguation approach

Build a statistical model grouping articles by an author with the same name

– Author lists, journals, co-authors, research area, affiliations, text of articles

– Multi-dimensional model

– Shoot for high precision, very good recall

Page 6: Focusing on the Individual: Find, Build, Suggest

Disambiguation approach – II

Many authors work in multiple areas

– With multiple co-authors, multiple communities

– Trying to group these results will break others

Allow such groups to remain separate

– Make it trivially easy for authors to merge

Final disambiguation step is human

– Key is to make the human step very simple

Page 7: Focusing on the Individual: Find, Build, Suggest

How does it work? Present groups of articles matching name

– Author selects groups written by her

After setup, updates are automated

Author can merge/add/remove

Changes fed back into the statistical model

– Improves update precision & recall

– Automated => must be hi-precision/hi-recall

Page 8: Focusing on the Individual: Find, Build, Suggest
Page 9: Focusing on the Individual: Find, Build, Suggest

What do you get?

List of all your publications

Citation metrics – overall and per article

Links to co-authors

Follow all your citations

Colleagues can follow your work

Personalized recommendations

Page 10: Focusing on the Individual: Find, Build, Suggest
Page 11: Focusing on the Individual: Find, Build, Suggest
Page 12: Focusing on the Individual: Find, Build, Suggest

How well does it work?

Worldwide adoption

– Widely published and cited authors

– Most authors take a few minutes

– Most authors opt for automated updates

– All countries, all areas

Page 13: Focusing on the Individual: Find, Build, Suggest
Page 14: Focusing on the Individual: Find, Build, Suggest
Page 15: Focusing on the Individual: Find, Build, Suggest
Page 16: Focusing on the Individual: Find, Build, Suggest

Why does it work? Statistical model is quite effective

– Able to achieve high precision

– Recall tradeoff is small

– Flexibility to split groups key, but not frequent

For most authors takes 5-10 minutes

– Effort is one-time, updates are automated

Enables many useful services – well worth the few minutes!

Page 17: Focusing on the Individual: Find, Build, Suggest

Finding Scholars

Queries including name

– Add matching author profiles as results

Keyword queries

– Link author names to author profiles

Browse researchers in an area

– Link author interests to search over profiles

Page 18: Focusing on the Individual: Find, Build, Suggest
Page 19: Focusing on the Individual: Find, Build, Suggest
Page 20: Focusing on the Individual: Find, Build, Suggest
Page 21: Focusing on the Individual: Find, Build, Suggest

Personalized recommendations

Challenge: keep up with rapid growth in articles

Approach:

– Centered around author profiles

– Analyze research interests & evolution

– Factor in co-authors & their evolution

– Leverage citation graph

– Scan all newly seen articles

Page 22: Focusing on the Individual: Find, Build, Suggest
Page 23: Focusing on the Individual: Find, Build, Suggest
Page 24: Focusing on the Individual: Find, Build, Suggest
Page 25: Focusing on the Individual: Find, Build, Suggest

Finally… Individuals key for recovering structure

– Allow efficient, cascade-able disambiguation

– Depts, institutions, funding agencies

– Most other analyses can be layered on top

Find what I need to read – classic hard problem

– Much progress, much much more to do