Mostrar el registro sencillo del ítem

dc.creatorArizaleta Arteaga, Mirenes_ES
dc.date.accessioned2011-04-15T11:05:30Z
dc.date.available2011-04-15T11:05:30Z
dc.date.issued2011
dc.identifier.other0000577411es_ES
dc.identifier.urihttps://hdl.handle.net/2454/3363
dc.description.abstractNowadays everybody uses or has used the web for different reasons. Everybody is aware that the web constitutes an architecture to access information and retrieves data in the form of interconnected documents which are distributed in millions of machines through the internet. The most commonly used protocol for the retrieval of such documents is the http (Hypertext Transfer Protocol). When a user “demands” to retrieve some document or some information in the web, the use of this protocol is enough to do so. In this way the user can move through websites retrieving each piece of information or document which are useful to them at any given time. The question raised here is what happens in the case that one demands to retrieve millions and billions of documents or to retrieve a large volume of information either for future processing or for a simple reading. With the constant increase of the volume of data in the web as well as the daily renewal of the contents of the various websites, it is understandable that it is impossible for such a vast volume of data to be collected by the user, and therefore it is imperative the need to create mechanisms to automate this data retrieval procedure. This is exactly the purpose of the present project: the design and implementation of a complete data collection system (web crawler), which is applicable in the field of real estate in Greece. More specifically, the system implemented concerns the data (advertisements) retrieval by the five most popular property sites. The ultimate goal of this implementation is the collection of the advertisements from the above mentioned websites, so as to extract conclusions and statistical data for the overall picture of the property market in Greece. In addition to the above system and with the purpose to meet its demands, a database to store the retrieved data by the crawling process of the advertisements was designed and implemented.en
dc.format.mimetypeapplication/pdfen
dc.format.mimetypeapplication/zipen
dc.language.isoengen
dc.subjectRecuperación de la informaciónes_ES
dc.subjectData collection systemses_ES
dc.subjectRastreadoreses_ES
dc.subjectInformation retrievalen
dc.subjectData collection systemsen
dc.subjectWeb crawlersen
dc.titleUpgrade system: "Crawling Process: The real estate case"en
dc.typeinfo:eu-repo/semantics/masterThesisen
dc.typeProyecto Fin de Carrera / Ikasketen Amaierako Proiektuaes
dc.contributor.affiliationEscuela Técnica Superior de Ingenieros Industriales y de Telecomunicaciónes_ES
dc.contributor.affiliationTelekomunikazio eta Industria Ingeniarien Goi Mailako Eskola Teknikoaeu
dc.contributor.affiliationAthens University of Economics and Business (Grecia)es_ES
dc.contributor.departmentIngeniería Matemática e Informáticaes_ES
dc.contributor.departmentMatematika eta Informatika Ingeniaritzaeu
dc.description.degreeIngeniería Técnica en Informática de Gestiónes_ES
dc.description.degreeKudeaketa Informatikako Ingeniaritza Teknikoaeu
dc.rights.accessRightsinfo:eu-repo/semantics/openAccessen
dc.rights.accessRightsAcceso abierto / Sarbide irekiaes
dc.contributor.advisorTFEPina Calafi, Alfredoes_ES
dc.contributor.advisorTFEVazirgiannis, Michalises_ES


Ficheros en el ítem

Thumbnail
Thumbnail

Este ítem aparece en la(s) siguiente(s) colección(ones)

Mostrar el registro sencillo del ítem


El Repositorio ha recibido la ayuda de la Fundación Española para la Ciencia y la Tecnología para la realización de actividades en el ámbito del fomento de la investigación científica de excelencia, en la Línea 2. Repositorios institucionales (convocatoria 2020-2021).
Logo MinisterioLogo Fecyt