Our aim is to harvest, preserve, manage and reuse web content.
State-of-the-art web and social media archiving

The ephemeral nature of web and social media sites, especially of blogs (comments, online discussions etc.) leave them at substantial risk of being lost. Memory Institutions (libraries, museums, archives) and organisations are researching for ways to ensure long-term preservation and reuse of web content.

In Webternity we have the answer: we have developed an exciting system to harvest, preserve, manage and reuse web content. The system is performing an intelligent harvesting operation which retrieves and parses hypertext as well as all other associated content (images, linked files, etc.) from websites. The parsing action is able to render the captured content into structured data, expressed in XML; it does this in accordance with the our data model.

The result of this action is carving semantic entities out of web content on an unprecedented micro-level. Author names, comments, subjects, tags, categories, dates, links, and many other elements are expressed within a hierarchical structure. This content is imported into the Webternity repository (based on CERN’s Invenio platform), a public-facing web archiving mechanism which provides facilities to preserve, view, interrogate and reuse the content to an unprecedented degree of detail.


Web archiving support and consultancy

We support our clients to create their own web and social media archiving centre by installing, customising and maintaining the Webternity platform.

Cloud based web and social media archiving

Our clients are able to create and manage their own web archiving repositories through our online platform.

On demand web content analytics

We support our clients to develop valuable insights from their repositories, on terms of their interest.

Take the next step on creating your own web archiving collections.

Get started


Our team

Stratos Arampatzis
Ilias Trochidis
Panagiotis Chatzikamaris
Product development
Anthony Minas Krasakis
Product development
Vangelis Banos
Research advisor
Athanasia Anoyrkati
Web development
Olympia Papadopoulou
Administrative executive

Get in touch