Please use this identifier to cite or link to this item:
https://www.um.edu.mt/library/oar/handle/123456789/141983| Title: | Table selection using information retrieval techniques for table-agnostic Text-to-SQL |
| Authors: | Bartolo, Timothy J. (2025) |
| Keywords: | SQL (Computer program language) Natural language processing (Computer science) Natural language generation (Computer science) Multisensor data fusion Database management -- Automation Data sets |
| Issue Date: | 2025 |
| Citation: | Bartolo, T. J. (2025). Table selection using information retrieval techniques for table-agnostic Text-to-SQL (Master’s dissertation). |
| Abstract: | Text-to-SQL has been effectively addressed using various NLP approaches, enabling the translation of natural language queries into SQL queries. A common prerequisite for these implementations, however, is the availability of the database table during inference. This requirement can pose challenges in scenarios where the table is not readily accessible to users. This work is motivated by the ongoing development of a chatbot tool within a private company, aimed at streamlining database interactions for the users. To address the table accessibility limitation, this study leverages Information Retrieval techniques to implement table selection based solely on the natural language query. We finetune pre-trained models like BERT and GIST-NoInstruct using the ColBERT method. We train our models using data we curate in-house by employing established LLM-prompting techniques. We prepare individual training datasets using two negative sampling techniques: uniform distribution and weighted probability distribution. We also experiment with various data fusion techniques such as RRF, CombMNZ, and Linear Combination to combine results from multiple search strategies. Our approach outperforms baseline methods in table retrieval, while also providing a comparative analysis of various retrieval strategies. |
| Description: | M.Sc.(Melit.) |
| URI: | https://www.um.edu.mt/library/oar/handle/123456789/141983 |
| Appears in Collections: | Dissertations - FacICT - 2025 Dissertations - FacICTAI - 2025 |
Files in This Item:
| File | Description | Size | Format | |
|---|---|---|---|---|
| 2519ICTICS520005082564_1.PDF | 1.69 MB | Adobe PDF | View/Open |
Items in OAR@UM are protected by copyright, with all rights reserved, unless otherwise indicated.
