Upgrade system: "Crawling Process: The real estate case"

Arizaleta Arteaga, Miren

dc.creator	Arizaleta Arteaga, Miren	es_ES
dc.date.accessioned	2011-04-15T11:05:30Z
dc.date.available	2011-04-15T11:05:30Z
dc.date.issued	2011
dc.identifier.other	0000577411	es_ES
dc.identifier.uri	https://hdl.handle.net/2454/3363
dc.description.abstract	Nowadays everybody uses or has used the web for different reasons. Everybody is aware that the web constitutes an architecture to access information and retrieves data in the form of interconnected documents which are distributed in millions of machines through the internet. The most commonly used protocol for the retrieval of such documents is the http (Hypertext Transfer Protocol). When a user “demands” to retrieve some document or some information in the web, the use of this protocol is enough to do so. In this way the user can move through websites retrieving each piece of information or document which are useful to them at any given time. The question raised here is what happens in the case that one demands to retrieve millions and billions of documents or to retrieve a large volume of information either for future processing or for a simple reading. With the constant increase of the volume of data in the web as well as the daily renewal of the contents of the various websites, it is understandable that it is impossible for such a vast volume of data to be collected by the user, and therefore it is imperative the need to create mechanisms to automate this data retrieval procedure. This is exactly the purpose of the present project: the design and implementation of a complete data collection system (web crawler), which is applicable in the field of real estate in Greece. More specifically, the system implemented concerns the data (advertisements) retrieval by the five most popular property sites. The ultimate goal of this implementation is the collection of the advertisements from the above mentioned websites, so as to extract conclusions and statistical data for the overall picture of the property market in Greece. In addition to the above system and with the purpose to meet its demands, a database to store the retrieved data by the crawling process of the advertisements was designed and implemented.	en
dc.format.mimetype	application/pdf	en
dc.format.mimetype	application/zip	en
dc.language.iso	eng	en
dc.subject	Recuperación de la información	es_ES
dc.subject	Data collection systems	es_ES
dc.subject	Rastreadores	es_ES
dc.subject	Information retrieval	en
dc.subject	Data collection systems	en
dc.subject	Web crawlers	en
dc.title	Upgrade system: "Crawling Process: The real estate case"	en
dc.type	info:eu-repo/semantics/masterThesis	en
dc.type	Proyecto Fin de Carrera / Ikasketen Amaierako Proiektua	es
dc.contributor.affiliation	Escuela Técnica Superior de Ingenieros Industriales y de Telecomunicación	es_ES
dc.contributor.affiliation	Telekomunikazio eta Industria Ingeniarien Goi Mailako Eskola Teknikoa	eu
dc.contributor.affiliation	Athens University of Economics and Business (Grecia)	es_ES
dc.contributor.department	Ingeniería Matemática e Informática	es_ES
dc.contributor.department	Matematika eta Informatika Ingeniaritza	eu
dc.description.degree	Ingeniería Técnica en Informática de Gestión	es_ES
dc.description.degree	Kudeaketa Informatikako Ingeniaritza Teknikoa	eu
dc.rights.accessRights	info:eu-repo/semantics/openAccess	en
dc.rights.accessRights	Acceso abierto / Sarbide irekia	es
dc.contributor.advisorTFE	Pina Calafi, Alfredo	es_ES
dc.contributor.advisorTFE	Vazirgiannis, Michalis	es_ES

Ficheros en el ítem

Nombre:: 577411.pdf
Tamaño:: 808.5Kb
Formato:: PDF

Ver/

Nombre:: 577411 anejo.zip
Tamaño:: 4.257Mb
Formato:: Desconocido

Ver/

Este ítem aparece en la(s) siguiente(s) colección(ones)

Mostrar el registro sencillo del ítem