Skip to main content
placeholder image

Combining bandits and lexical analysis for document retrieval in a juridical corpora

Conference Paper


Abstract


  • Helping users to find pertinent documents within a big corpus through the use of simple queries on a search engine is a major concern in the information retrieval field. The work presented in this article combines the use of standard natural language processing methods to estimate the relevance of a document to a query with an online preference learning method to infer such kind of pertinence by analyzing the past behavior of other users making similar searches. The first contribution of this article is the proposition of a specific heuristic method, conceived for an open access online juridical corpus, to filter and interpret data collected from the user behavior while navigating on the search engine’s query interface, on the list of results, and on the documents themselves. The second contribution is an original way for combining multiarmed bandit algorithms for learning pertinence from the user implicit feedback with natural language processing techniques in order to define a unique ranking for the search results.

Publication Date


  • 2020

Citation


  • Perotto, F. S., Verstaevel, N., Trabelsi, I., & Vercouter, L. (2020). Combining bandits and lexical analysis for document retrieval in a juridical corpora. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) Vol. 12498 LNAI (pp. 317-330). doi:10.1007/978-3-030-63799-6_24

Scopus Eid


  • 2-s2.0-85098254218

Web Of Science Accession Number


Start Page


  • 317

End Page


  • 330

Volume


  • 12498 LNAI

Abstract


  • Helping users to find pertinent documents within a big corpus through the use of simple queries on a search engine is a major concern in the information retrieval field. The work presented in this article combines the use of standard natural language processing methods to estimate the relevance of a document to a query with an online preference learning method to infer such kind of pertinence by analyzing the past behavior of other users making similar searches. The first contribution of this article is the proposition of a specific heuristic method, conceived for an open access online juridical corpus, to filter and interpret data collected from the user behavior while navigating on the search engine’s query interface, on the list of results, and on the documents themselves. The second contribution is an original way for combining multiarmed bandit algorithms for learning pertinence from the user implicit feedback with natural language processing techniques in order to define a unique ranking for the search results.

Publication Date


  • 2020

Citation


  • Perotto, F. S., Verstaevel, N., Trabelsi, I., & Vercouter, L. (2020). Combining bandits and lexical analysis for document retrieval in a juridical corpora. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) Vol. 12498 LNAI (pp. 317-330). doi:10.1007/978-3-030-63799-6_24

Scopus Eid


  • 2-s2.0-85098254218

Web Of Science Accession Number


Start Page


  • 317

End Page


  • 330

Volume


  • 12498 LNAI