When Logical Inference Helps Determining Textual Entailment (and When it Doesn´t)
When Logical Inference Helps Determining Textual Entailment (and When it Doesn´t)
0.25
0.5
0.75
1.25
1.5
1.75
2
We compare and combine two methods to approach the second textual entailment challenge (RTE-2): a shallow method based mainly on word-overlap and a method based on logical inference, using first-order theorem proving and model building techniques. We use a machine learning technique to combine features of both methods. We submitted two runs, one using only the shallow features, yielding an accuracy of 61.6%, and one using features of both methods, performing with an accuracy score of 60.6%. These figures suggest that logical inference didn´t help much. Closer inspection of the results revealed that only for some of the subtasks logical inference played a significant role in performance. We try to explain the reason for these results.