About: Acces to web archives, querying, navigating and optimizing

Facets (new session)
Description
Metadata
Settings
- owl:sameAs
- Inference Rule:

About: Acces to web archives, querying, navigating and optimizing Goto Sponge NotDistinct Permalink

An Entity of Type : rdac:C10001, within Data Space : data.idref.fr associated with source document(s)

Attributes	Values
type	frbr:Work rdac:C10001
Thesis advisor	Doucet, Anne (19..-.... ; informaticienne)
Author	Pehlivan, Zeynep (1981-....)
dc:subject	Web sémantique Thèses et écrits académiques Architecture des réseaux d'ordinateurs Édition en libre accès Publications électroniques Archivage électronique Information électronique -- Conservation
preferred label	Acces to web archives, querying, navigating and optimizing
Language	http://lexvo.org/id/iso639-3/eng
Subject	http://www.idref.fr/084038853/id http://www.idref.fr/034878084/id http://www.idref.fr/076053547/id http://www.idref.fr/02758772X/id http://www.idref.fr/028869931/id http://www.idref.fr/027253139/id http://www.idref.fr/149504772/id
dc:title	Acces to web archives, querying, navigating and optimizing
Degree granting institution	Université Pierre et Marie Curie (Paris ; 1971-2017)
note	Le Web crée chaque jour une quantité importante de connaissances culturelles et intellectuelles.Ses informations sont de nature éphémère car elles sont constamment remplacées, parfois sans aucunenotification. C’est pour cette raison que l’archivage du web est devenue une nécessité culturelle afinde préserver la connaissance pour les prochaines générations. Son succès sera cependant mesuré parses modes d’accès, comme ceux fournis jusqu’ici par le web. Notre recherche situe dans le contexte del’accès aux archives web, et étudie les différents problèmes d’accès qui y sont liés. Ces problèmes sontgroupés en deux thèmes principaux : Méthodes d’accès et Optimisation des accès. Pour les méthodesd’accès, nous proposons la base d’un langage de requête ayant par objectif de de mieux satisfaire lesbesoins d’information des utilisateurs. Une nouvelle méthode de navigation est ensuite introduite, quiprend en compte la cohérence des pages. Pour l’optimisation de l’accès, nous proposons un algorithmede détection de changement pour comprendre et quantifier ce qui s’est passé (et a donc changé) entredeux versions d’une même page Web. Nous étudions aussi le comportement des différentes méthodesd’élagage d’index statiques avec des requêtes temporelles. En outre, nous proposons une nouvelle méthode d’élagage index statiques basée sur la diversification et nous montrons son application aux collections temporelles et un gain supstanciel de performance par rapport aux autres approaches. An important amount of the world’s cultural and intellectual knowledge is being created on the webeveryday. However, the web has en ephemeral nature e.g. new information replaces older informationconstantly without any notification, leaving a significant gap in our knowledge. That’s why archivingthe web has become a cultural necessity to preserve the knowledge for the next generations. However,the success of any web archive will be measured by the means of access it provides; as it is the casetoday on the real web. Our research is placed in the context of access to web archives and studiesdifferent research problems related to this issue. These research problems are grouped into two maintopics: Access Methods and Optimization of Access. For access methods, we first propose a conceptualmodel, as well as operators to manipulate them, as the basis of a query language for web archives tobetter satisfy user information needs. Next, a new navigation method for web archives that takes thecoherence of pages into account is introduced. In the context of access optimization, we propose achange detection algorithm to understand and to quantify what happened (and thus changed) betweentwo versions of a web page. Then, we study the behavior of different static index pruning methodswith temporal queries before proposing a new diversification-based static index pruning method andshowing its application to temporal collections and a substantial gain in performance.
dc:type	Text
http://iflastandar...bd/elements/P1001	http://iflastandards.info/ns/isbd/terms/contentform/T1009
rdaw:P10219	2013
has content type	http://rdaregistry.info/termList/RDAContentType/1020
is primary topic of	http://www.idref.fr/248164260
is rdam:P30135 of	http://www.sudoc.fr/178314420/id http://www.sudoc.fr/248051113/id

Faceted Search & Find service v1.13.91 as of Aug 16 2018

Alternative Linked Data Documents: ODE Content Formats:

RDF

ODATA

Microdata

About

OpenLink Virtuoso version 07.20.3229 as of May 14 2019, on Linux (x86_64-pc-linux-gnu), Single-Server Edition (70 GB total memory)
Data on this page belongs to its respective rights holders.
Virtuoso Faceted Browser Copyright © 2009-2024 OpenLink Software