Translations:Archiveren van de website en sociale media van Studio ORKA/4/en
Website
Since the website contained extensive descriptions of the performances, the archivist decided to start with this material. Initially, an attempt was made to automate the process using a web crawler application to scan and store the entire website. This was done with Heritrix, a versatile web crawler often used for such tasks. For this specific application, where it was crucial that every link was correctly captured, this option proved problematic: some links were saved, while others were missing or not working correctly. This made the results unreliable and incomplete. They therefore moved away from Heritrix and opted for Archive WebPage, manually going through all the links on the Studio ORKA website to save the entire site in both WARC and WACZ formats (Web ARChive).