0% found this document useful (0 votes)

4 views

Information Retrieval

The document outlines a midterm exam for the Advanced Software Engineering course, including questions on term-document incidence matrices, positional indexes, and F1 measures for information retrieval systems. It provides specific tasks for students to complete, such as drawing matrices and calculating performance metrics. Additionally, it includes true/false questions related to information retrieval concepts.

Uploaded by

kikofifo5

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views

Information Retrieval

Uploaded by

kikofifo5

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

Faculty of Engineering & Technology ‫ كلية الهندسة والتكنولوجيا‬-IUST

Department: Computer Exam: Mid

Date: 13/04/2024 Semester: 1st Year: 2023/2024
and Informatics Solutions
Engineering
Course No.: 306512 Course Name: Advanced Software Engineering Instructor: Dr.-Ing. Basel Hasan
Student No.: Student Name: Number of Pages: 1

Question #1 (10 Marks)

Consider the following document collection:
- D1 A D C M C A
- D2 B A M
- D3 C A R
- D4 A M C
1. Draw the term-document incidence matrix for this collection. Use the resulted matrix to process
the query (A or C) AND NOT R.
2. Draw the positional index representation for this collection. Explain a way to use this index to
process the query A NEAR/2 M.
2m

D1 D2 D3 D4
A 1 1 1 1
(1111 or 1011) and not(0010) =
D 1 0 0 0 1111 and 1101 = 1101 → d1, d2, d4 2m
C 1 0 1 1
M 1 1 0 1
B 0 1 0 0
R 0 0 1 0

3m
A 1: ( 1,6); 2:(2); 3:(2); 4:(1)
B 2: ( 1);
C 1: ( 3,5); 3:(1); 4: ( 3);
D 1: ( 2);
M 1: ( 4); 2:(3); 4: ( 2);
R 3: ( 3);
A NEAR/2 M 3m
take the postings list for A → 1: ( 1,6); 2:(2); 3:(2); 4:(1)
take the postings list for M → 1: ( 4); 2:(3); 4: ( 2);
merge on equals docIDs → we get the documents 1, 2, 4
compare the positions → D2 and D4 will be retrieved.
Question #2 (10 Marks)
Consider an information need for which there are 7 relevant documents A B
in the collection. Two IR systems (A and B) run on this collection. Their 1 N R
top 10 results are judged for relevance as shown aside. 2 N R
3 R N
1. Compute the F1 measure for each system?
4 R N
2. Based on the resulted F1, which system performs better, system A or
5 R R
system B?
6 N N
3. What is the user model behind F1 measure? 7 N
4. If we want the precision to be 2 times more important than recall, which 8 N
system performs better, system A or system B? 9 R
10 N

1.
For System A:
P = 4/10 = 0.4 R = 4/7 ≈ 0.57
F1 = 2*P*R / (P+R) = 2 * 0.4 * 0.57 / (0.4+ 0.57) ≈ 0.47 1m
For System B:
P = 3/6 = 0.5 R = 3/7 ≈ 0.43
F1 = 2*P*R / (P+R) = 2 * 0.5 * 0.43 / (0.5 + 0.43) ≈ 0.46 1m

→ System A performs better as it has higher F1 score. 1m

2. The user wants to get as much as relevant documents with as less as irrelevant documents. 2m
3.

𝛽 = 1/2
For System A:
Fᵦ = (𝛽2 + 1)*P*R / (𝛽2 ∗P+R) = (0.25+1) * 0.4 * 0.57 / (0.25*0.4+ 0.57) ≈ 0.43 2m
For System B:
Fᵦ = (𝛽2 + 1)*P*R / (𝛽2 ∗P+R) = (0.25+1) * 0.5 * 0.43 / (0.25*0.5 + 0.43) ≈ 0.48 2m
➔ System B performs better as it has higher Fᵦ score. 1m
Question #3 (5 Marks)
Answer with true or false and correct the false statements.
1. The main three IR System components are: documents, query and relevance judgments. False.
… and relevant documents.
2. Terms are the output of tokenization process. False. ..of normalization..
3. Consider N = 500 documents, each with max of 10 words, M = 300 distinct terms among these
documents. The term-document incidence matrix for this collection includes 500 1’s at maximum.
False. 5000 1’s
4. In the BOW concept, re-ordering the words in a document doesn’t destroy its topic. True.
5. Relevance has a value with respect to the information need. True.

Good Luck
Dr.-Ing. Basel Hasan

IR MCQ With Answers
100% (1)
IR MCQ With Answers
23 pages
CS3308 Information Retrieval Quiz
50% (2)
CS3308 Information Retrieval Quiz
63 pages
Practice Question For Information Retrieval Subject
No ratings yet
Practice Question For Information Retrieval Subject
5 pages
CSI 4107 - Winter 2016 - Midterm
0% (1)
CSI 4107 - Winter 2016 - Midterm
10 pages
QP Midsem Regular - Solutions For IR
100% (2)
QP Midsem Regular - Solutions For IR
4 pages
Midterm2006 Sol Csi4107
100% (2)
Midterm2006 Sol Csi4107
9 pages
Solution.: Increase - 3
No ratings yet
Solution.: Increase - 3
5 pages
Quiz&Solution
No ratings yet
Quiz&Solution
2 pages
T1 PDF
No ratings yet
T1 PDF
2 pages
IR END PYQ SOLS
No ratings yet
IR END PYQ SOLS
8 pages
asila-IR
No ratings yet
asila-IR
16 pages
Final Exam [Spring 2020 - V1]
No ratings yet
Final Exam [Spring 2020 - V1]
11 pages
IR QB
No ratings yet
IR QB
8 pages
assignment_1
No ratings yet
assignment_1
12 pages
HW 6
No ratings yet
HW 6
1 page
117DX052018
No ratings yet
117DX052018
2 pages
Bits Pilani, Dubai Campus
No ratings yet
Bits Pilani, Dubai Campus
11 pages
2019 - Final Solution - Spring - IR
No ratings yet
2019 - Final Solution - Spring - IR
10 pages
IR_midsem Question Paper_2024_solutionfull (2)
No ratings yet
IR_midsem Question Paper_2024_solutionfull (2)
7 pages
NLP SEE
No ratings yet
NLP SEE
9 pages
Irt Ans
No ratings yet
Irt Ans
9 pages
NLP SEE
No ratings yet
NLP SEE
27 pages
Assignment 1
No ratings yet
Assignment 1
2 pages
ilovepdf_merged (1) (1)
No ratings yet
ilovepdf_merged (1) (1)
4 pages
Exercises&Solutions
No ratings yet
Exercises&Solutions
3 pages
Information Retrieval
100% (1)
Information Retrieval
11 pages
ISR Question Bank
No ratings yet
ISR Question Bank
19 pages
Theory Assignment
No ratings yet
Theory Assignment
4 pages
c3-paper(1)
No ratings yet
c3-paper(1)
4 pages
Irs Question Papers
No ratings yet
Irs Question Papers
6 pages
c3 Paper
No ratings yet
c3 Paper
3 pages
sample_question
No ratings yet
sample_question
19 pages
Unit 4_ Experimental Evaluation of IR
No ratings yet
Unit 4_ Experimental Evaluation of IR
4 pages
IR Model Question Paper
No ratings yet
IR Model Question Paper
2 pages
L02-IR Models MMN
No ratings yet
L02-IR Models MMN
27 pages
April 2019
No ratings yet
April 2019
2 pages
NLP Mod-V Q - A (Uploaded by Snaptricks - In)
No ratings yet
NLP Mod-V Q - A (Uploaded by Snaptricks - In)
7 pages
Updated IR
No ratings yet
Updated IR
38 pages
IRDM Assignment-I PDF
No ratings yet
IRDM Assignment-I PDF
4 pages
PART-I: Multiple Choices: Jimma University
100% (1)
PART-I: Multiple Choices: Jimma University
6 pages
IRS6
No ratings yet
IRS6
2 pages
IRS7
No ratings yet
IRS7
2 pages
2019 Spring Final Sol
No ratings yet
2019 Spring Final Sol
19 pages
Mid Semster Exam QP
100% (2)
Mid Semster Exam QP
2 pages
R05411201 Informationretrievalsystems
No ratings yet
R05411201 Informationretrievalsystems
4 pages
TYBSC-CS - SEM6 - IR - APR19 Munotes Mumbai University
No ratings yet
TYBSC-CS - SEM6 - IR - APR19 Munotes Mumbai University
2 pages
ACFrOgAhDKMNiLdAKJ27Hzg52gNTQw 5K PHitykqmtwIgd9UKTVkmihywbzrIyBvrHsHZZ9wixYTTAUoZYnERTr6vUQ Cfqlt65bXEVoMBh Ta3S1geQE-C8DUlimE
No ratings yet
ACFrOgAhDKMNiLdAKJ27Hzg52gNTQw 5K PHitykqmtwIgd9UKTVkmihywbzrIyBvrHsHZZ9wixYTTAUoZYnERTr6vUQ Cfqlt65bXEVoMBh Ta3S1geQE-C8DUlimE
2 pages
JanuaryFebruary-2023 Irs
No ratings yet
JanuaryFebruary-2023 Irs
2 pages
IR - Set 1
No ratings yet
IR - Set 1
5 pages
all unit 2 mark
No ratings yet
all unit 2 mark
15 pages
Database Systems The Complete Book 2nd Edition Molina Solutions Manual pdf download
100% (1)
Database Systems The Complete Book 2nd Edition Molina Solutions Manual pdf download
26 pages
IR Chapt 5
No ratings yet
IR Chapt 5
55 pages
B Tech WSM CSE 442 Endterm Online NOV 20-11-2021
No ratings yet
B Tech WSM CSE 442 Endterm Online NOV 20-11-2021
3 pages
Text Mining
No ratings yet
Text Mining
23 pages
University of Virginia Department of Computer Science CS 4501: Information Retrieval Fall 2015
No ratings yet
University of Virginia Department of Computer Science CS 4501: Information Retrieval Fall 2015
10 pages
4_IRModels
No ratings yet
4_IRModels
46 pages
CS8080 INFORMATION RETRIEVAL TECHNIQUES II INTERNAL EXAMINATION - Google Forms
No ratings yet
CS8080 INFORMATION RETRIEVAL TECHNIQUES II INTERNAL EXAMINATION - Google Forms
420 pages
CCS369 - TSS-Unit 3
No ratings yet
CCS369 - TSS-Unit 3
55 pages
Introduction To Information Retrieval: Courtesy
No ratings yet
Introduction To Information Retrieval: Courtesy
61 pages
Data Science Using Python and R
From Everand
Data Science Using Python and R
Chantal D. Larose
No ratings yet
PolarChoice Plus Brochure
No ratings yet
PolarChoice Plus Brochure
2 pages
PHP Paper PDF
100% (1)
PHP Paper PDF
3 pages
Ejercicios Sistemas de Ecuaciones Lineales
No ratings yet
Ejercicios Sistemas de Ecuaciones Lineales
8 pages
Energy Efficient/ Green Cloud Computing: BY Navneet Singh Pursuing Ph.D. From SBBSU, Jalandhar
No ratings yet
Energy Efficient/ Green Cloud Computing: BY Navneet Singh Pursuing Ph.D. From SBBSU, Jalandhar
62 pages
Arduino Midi Piano Pull Up
No ratings yet
Arduino Midi Piano Pull Up
3 pages
BSCE - Prospectus
No ratings yet
BSCE - Prospectus
12 pages
Laminar Airflow
No ratings yet
Laminar Airflow
15 pages
Chapter 2 - Force Systems
No ratings yet
Chapter 2 - Force Systems
37 pages
Chemistry Post Basic Two
No ratings yet
Chemistry Post Basic Two
3 pages
Performance of Education Graduates in The Licensure Examination For Teachers (Let)
No ratings yet
Performance of Education Graduates in The Licensure Examination For Teachers (Let)
22 pages
4.3 Hernández-ospina et al. 2024
No ratings yet
4.3 Hernández-ospina et al. 2024
13 pages
Int F (Int N) (Static Int I 1 If (N 5) Return N N N+i I++ Return F (N) )
No ratings yet
Int F (Int N) (Static Int I 1 If (N 5) Return N N N+i I++ Return F (N) )
9 pages
Linux Question Bank 2013 - 14 CBGS Sem VI
No ratings yet
Linux Question Bank 2013 - 14 CBGS Sem VI
6 pages
A Multiport Bidirectional DC-DC Converter for Hybrid Renewable
No ratings yet
A Multiport Bidirectional DC-DC Converter for Hybrid Renewable
7 pages
Solar Refrigeration System: Introduction To Solar Refrigerator
100% (1)
Solar Refrigeration System: Introduction To Solar Refrigerator
4 pages
IP Source Routing
No ratings yet
IP Source Routing
3 pages
Multi Channel Gas Detector Receiver: GTC-200A Series
No ratings yet
Multi Channel Gas Detector Receiver: GTC-200A Series
2 pages
DPP_04_Function as a Special Type of Relation, Domain, Range_Mathematics_12th_JEE_Gulf (UAE)_Shrey Baxi Sir_Aman Khan_Waliba
No ratings yet
DPP_04_Function as a Special Type of Relation, Domain, Range_Mathematics_12th_JEE_Gulf (UAE)_Shrey Baxi Sir_Aman Khan_Waliba
1 page
Design and Static Analysis of Gearbox For A CNC
No ratings yet
Design and Static Analysis of Gearbox For A CNC
9 pages
AEDsys 1
No ratings yet
AEDsys 1
10 pages
Download Full (Ebook) An Introduction to Decision Theory by Martin Peterson ISBN 9781107151598, 9781316585061, 9781316606209, 1107151597, 1316585069, 1316606201 PDF All Chapters
100% (1)
Download Full (Ebook) An Introduction to Decision Theory by Martin Peterson ISBN 9781107151598, 9781316585061, 9781316606209, 1107151597, 1316585069, 1316606201 PDF All Chapters
65 pages
Structural Analysis Abaqus
No ratings yet
Structural Analysis Abaqus
33 pages
NASA SMRC Technologies
No ratings yet
NASA SMRC Technologies
170 pages
Assignment Ict L
No ratings yet
Assignment Ict L
40 pages
Pall Water - IMPRO - CCRO Skid System
No ratings yet
Pall Water - IMPRO - CCRO Skid System
2 pages
SOS Seating Chart
No ratings yet
SOS Seating Chart
2 pages
Hgtd7N60B3S, Hgt1S7N60B3S, Hgtp7N60B3: 14A, 600V, Ufs Series N-Channel Igbts Features
No ratings yet
Hgtd7N60B3S, Hgt1S7N60B3S, Hgtp7N60B3: 14A, 600V, Ufs Series N-Channel Igbts Features
7 pages
Range Guide 2017
No ratings yet
Range Guide 2017
81 pages
Word Order 1a Present Simple
No ratings yet
Word Order 1a Present Simple
34 pages
18N50
No ratings yet
18N50
8 pages

Uploaded by

Uploaded by

Faculty of Engineering & Technology ‫ كلية الهندسة والتكنولوجيا‬-IUST

Department: Computer Exam: Mid

Question #1 (10 Marks)

→ System A performs better as it has higher F1 score. 1m

You might also like