Mostrar el registro sencillo del ítem
Upgrade system: "Crawling Process: The real estate case"
dc.creator | Arizaleta Arteaga, Miren | es_ES |
dc.date.accessioned | 2011-04-15T11:05:30Z | |
dc.date.available | 2011-04-15T11:05:30Z | |
dc.date.issued | 2011 | |
dc.identifier.other | 0000577411 | es_ES |
dc.identifier.uri | https://hdl.handle.net/2454/3363 | |
dc.description.abstract | Nowadays everybody uses or has used the web for different reasons. Everybody is aware that the web constitutes an architecture to access information and retrieves data in the form of interconnected documents which are distributed in millions of machines through the internet. The most commonly used protocol for the retrieval of such documents is the http (Hypertext Transfer Protocol). When a user “demands” to retrieve some document or some information in the web, the use of this protocol is enough to do so. In this way the user can move through websites retrieving each piece of information or document which are useful to them at any given time. The question raised here is what happens in the case that one demands to retrieve millions and billions of documents or to retrieve a large volume of information either for future processing or for a simple reading. With the constant increase of the volume of data in the web as well as the daily renewal of the contents of the various websites, it is understandable that it is impossible for such a vast volume of data to be collected by the user, and therefore it is imperative the need to create mechanisms to automate this data retrieval procedure. This is exactly the purpose of the present project: the design and implementation of a complete data collection system (web crawler), which is applicable in the field of real estate in Greece. More specifically, the system implemented concerns the data (advertisements) retrieval by the five most popular property sites. The ultimate goal of this implementation is the collection of the advertisements from the above mentioned websites, so as to extract conclusions and statistical data for the overall picture of the property market in Greece. In addition to the above system and with the purpose to meet its demands, a database to store the retrieved data by the crawling process of the advertisements was designed and implemented. | en |
dc.format.mimetype | application/pdf | en |
dc.format.mimetype | application/zip | en |
dc.language.iso | eng | en |
dc.subject | Recuperación de la información | es_ES |
dc.subject | Data collection systems | es_ES |
dc.subject | Rastreadores | es_ES |
dc.subject | Information retrieval | en |
dc.subject | Data collection systems | en |
dc.subject | Web crawlers | en |
dc.title | Upgrade system: "Crawling Process: The real estate case" | en |
dc.type | info:eu-repo/semantics/masterThesis | en |
dc.type | Proyecto Fin de Carrera / Ikasketen Amaierako Proiektua | es |
dc.contributor.affiliation | Escuela Técnica Superior de Ingenieros Industriales y de Telecomunicación | es_ES |
dc.contributor.affiliation | Telekomunikazio eta Industria Ingeniarien Goi Mailako Eskola Teknikoa | eu |
dc.contributor.affiliation | Athens University of Economics and Business (Grecia) | es_ES |
dc.contributor.department | Ingeniería Matemática e Informática | es_ES |
dc.contributor.department | Matematika eta Informatika Ingeniaritza | eu |
dc.description.degree | Ingeniería Técnica en Informática de Gestión | es_ES |
dc.description.degree | Kudeaketa Informatikako Ingeniaritza Teknikoa | eu |
dc.rights.accessRights | info:eu-repo/semantics/openAccess | en |
dc.rights.accessRights | Acceso abierto / Sarbide irekia | es |
dc.contributor.advisorTFE | Pina Calafi, Alfredo | es_ES |
dc.contributor.advisorTFE | Vazirgiannis, Michalis | es_ES |