Research on Multi-Modal Large Language Models and their application for the verification and validation of identity documents

Otazu Redín, Judit

Research on Multi-Modal Large Language Models and their application for the verification and validation of identity documents

Files

TFM_FINAL.pdf (38.53 MB)

Consultable a partir de

2029-10-01

Date

2024

Authors

Otazu Redín, Judit

Director

Forcén Carvalho, Juan Ignacio

Acceso embargado / Sarbidea bahitua dago

Trabajo Fin de Máster / Master Amaierako Lana

Abstract

In response to the exponential growth and increasing adoption of Multi-Modal Large Language Models, this project aims to explore their application in a critical field: the verification and validation of identity documents. These models, which effectively integrate image, text, video, and audio processing, are proposed as potential improvements over traditional systems specialized in specific tasks. The research will compare the effectiveness of MM-LLMs against dedicated models, including both commercial and open-source solutions, in key tasks such as classification, image quality, fraud detection, OCR (Optical Character Recognition) and Entity Mapping. Additionally, the explainability of these multimodal models will be analyzed, offering a transparent alternative to the opacity of the ’black box’ typically associated with artificial intelligence. The study also recognizes and addresses the challenges that arise from the substantial hardware demands and potential latency issues inherent in these advanced systems.

Keywords

Large Language Model, Multi-Modal Large Language Model, Natural Language Processing, Computer Vision, Transformers, Embeddings, CNN, OCR (Optical Character Recognition), Document Authenticity, Anti-spoofing, Prompting

Faculty/School

Escuela Técnica Superior de Ingeniería Industrial, Informática y de Telecomunicación / Industria, Informatika eta Telekomunikazio Ingeniaritzako Goi Mailako Eskola Teknikoa

Degree

Máster Universitario en Ingeniería Informática por la Universidad Pública de Navarra, Nafarroako Unibertsitate Publikoko Unibertsitate Masterra Informatika Ingeniaritzan

URI

https://academica-e.unavarra.es/handle/2454/52031

Collections

Trabajos Fin de Máster ETSIIT - TIIGMET Master Amaierako Lanak
Trabajos Fin de Máster - Master Amaierako Lanak

Full item page

Research on Multi-Modal Large Language Models and their application for the verification and validation of identity documents

Files

Consultable a partir de

Date

Authors

Director

Publisher

Project identifier

Abstract

Description

Keywords

Department

Faculty/School

Degree

Doctorate program

URI

item.page.cita

item.page.rights

Collections