arxiv_cs ([info]arxiv_cs) wrote,
@ 2008-06-14 10:22:00
Previous Entry  Add to memories!  Tell a Friend!  Next Entry
Clustering of scientific citations in Wikipedia. (arXiv:0805.1154v2 [cs.DL] UPDATED)

Clustering of scientific citations in Wikipedia. (arXiv:0805.1154v2 [cs.DL] UPDATED)

The instances of templates in Wikipedia form an interesting data set of structured information. Here I focus on the cite journal template that is primarily used for citation to articles in scientific journals. These citations can be extracted and analyzed: Non-negative matrix factorization is performed on a (article x journal) matrix resulting in a soft clustering of Wikipedia articles and scientific journals, each cluster more or less representing a scientific topic.


read more at cs updates on arXiv.org



Create an Account
Forgot your login or password?
Login w/ OpenID
English • Español • Deutsch • Русский…