Information Retrieval - Question paper 2074
Syllabus Notes Old Questions & solutions Text & reference booksOther question papers
Attempt any ten questions.
1. How IR in web search is different from other IR systems? Discuss IR architecture with suitable example.
2+4 Marks
2. Assume that document space is defined by four terms: Network, CSIT, Nepal, TU and Graduate. And we have three documents containing the following terms:
Doc1: CSIT Nepal
Doc2: TU CSIT
Doc3: CSIT TU Nepal
If the query is "CSIT NEPAL", find top 2 documents retrieved by Boolean space model
6 Marks
3. What is meant by stop word removal? Explain text normalization with suitable example.
1+5 Marks
4. Suppose the table given below lists all the documents retrieved by an algorithm. If total number of relevant documents is 6, calculate the value of recall, precision, and F-score.
sn | Doc ID | relevant |
1 | D1 | no |
2 | D2 | no |
3 | D3 | yes |
4 | D4 | no |
5 | D5 | yes |
6 | D6 | yes |
7 | D7 | no |
8 | D8 | no |
9 | D9 | yes |
6 Marks
5. Why query expansion is important? Discuss query expansion techniques with examples.
1+5 Marks
7. How Bots are different from spiders? Describe simple and multithreded spidering algorithm.
1+5 Marks
8. How text categorization is different from clustering? Explain nearest neighbor categorization algorithm.
1+5 Marks
9. Differentiate collaborative filtering from content based filtering? Discuss content based recommender system with its strengths and drawbacks
2+4 Marks
10. Why TF-IDF weighting is important in information retrieval? Explain with suitable example.
6 Marks
11. How information extraction differs from information retrieval? Discuss role of XML in information extraction.
6 Marks