A Case Study in Web Search Using TREC Algorithms - http://www10.org/cdrom/papers/317/
This study evaluates the performance of a state-of-the-art keyword-based document ranking algorithm (coming out of TREC) on a popular web search task. |
Document Ranking and the Vector-Space Model. - http://www.cs.ust.hk/~dlee/Papers/ir/ieee-sw-rank.ps.gz
It describes key issues in document ranking techniques based on the vector space model. Several TF*IDF variants are discussed. The cosine measure, recall and precision are introduced. [PS format] |
Exploring the Similarity Space - http://goanna.cs.rmit.edu.au/~jz/fulltext/sigirforum98.pdf
Evaluation of many combinations of term frequency statistics, document frequency statistics and document length normalization. |
Information Retrieval Tutorial - http://isp.imm.dtu.dk/thor/projects/multimedia/textmining/
Description of boolean retrieval, vector space model, probabilistic retrieval, latent semantic indexing and other IR topics. An introduction to various classical ranking methods is also provided. |
Latent Semantic Indexing: a Probabilistic Analysis - http://www.cs.berkeley.edu/~christos/ir.ps
Formal introduction to latent semantic indexing. [PS format] |