Archiving of Dynamic Web Content

Thesis Type Master
Thesis Status
Finished
Student Stefan Pröll
Final
Start
Thesis Supervisor
Contact

Digital documents get more and more important as they exceed conventional documents in variety even today. Whereas classical long-term preservation procedures are successfully applied to classical documents since a few years, the development of preservation procedures for electronic documents is at a rather early stage.
The aim of this thesis is to analyse current technologies, which enable to store Web sites and digital documents permanently in a database. The focus is on algorithms, which utilize the technique of incremental storage. This method (delty-copy) records only the changes of the content and it avoids duplicates. A further goal of these procedures is to retrace the evolution of Web sites through the time.