Repository logo
  • Español
  • Euskera
  • English
  • Log In
    New user? Click here to register. Have you forgotten your password?
Repository logo
  • Communities & Collections
  • All of DSpace
  • Español
  • Euskera
  • English
  • Log In
    New user? Click here to register. Have you forgotten your password?
  1. Home
  2. Browse by Author

Browsing by Author "Vazirgiannis, Michalis"

Now showing 1 - 1 of 1
Results Per Page
Sort Options
  • Loading...
    Thumbnail Image
    PublicationOpen Access
    Upgrade system: "Crawling Process: The real estate case"
    (2011) Arizaleta Arteaga, Miren; Pina Calafi, Alfredo; Vazirgiannis, Michalis; Escuela Técnica Superior de Ingenieros Industriales y de Telecomunicación; Telekomunikazio eta Industria Ingeniarien Goi Mailako Eskola Teknikoa; Athens University of Economics and Business (Grecia); Ingeniería Matemática e Informática; Matematika eta Informatika Ingeniaritza
    Nowadays everybody uses or has used the web for different reasons. Everybody is aware that the web constitutes an architecture to access information and retrieves data in the form of interconnected documents which are distributed in millions of machines through the internet. The most commonly used protocol for the retrieval of such documents is the http (Hypertext Transfer Protocol). When a user “demands” to retrieve some document or some information in the web, the use of this protocol is enough to do so. In this way the user can move through websites retrieving each piece of information or document which are useful to them at any given time. The question raised here is what happens in the case that one demands to retrieve millions and billions of documents or to retrieve a large volume of information either for future processing or for a simple reading. With the constant increase of the volume of data in the web as well as the daily renewal of the contents of the various websites, it is understandable that it is impossible for such a vast volume of data to be collected by the user, and therefore it is imperative the need to create mechanisms to automate this data retrieval procedure. This is exactly the purpose of the present project: the design and implementation of a complete data collection system (web crawler), which is applicable in the field of real estate in Greece. More specifically, the system implemented concerns the data (advertisements) retrieval by the five most popular property sites. The ultimate goal of this implementation is the collection of the advertisements from the above mentioned websites, so as to extract conclusions and statistical data for the overall picture of the property market in Greece. In addition to the above system and with the purpose to meet its demands, a database to store the retrieved data by the crawling process of the advertisements was designed and implemented.
Con la colaboración del Ministerio de Ciencia e Innovación y de la Fundación Española para la Ciencia y la Tecnología (FECYT).

© Universidad Pública de Navarra - Nafarroako Unibertsitate Publikoa

  • Aviso legal
  • Protección de datos
  • Sugerencias
  • Contacto: academica-e@unavarra.es, +34 948 16 89 73, +34 948 16 89 74
  • Powered by DSpace