Fifteenth harvesting of the national domain

16. 12. 2025.

In order to complete the scope of archived web publications, the National and University Library in Zagreb (NSK), in cooperation with SRCE, will for the fifteenth time harvest content published under the national .hr domain, in accordance with the Ordinance on Legal Deposit (Official Gazette 66/2020).

The cooperation between NSK and SRCE on the development of the Croatian Web Archive has been ongoing since 2004, when the collection of legal deposit copies of online publications was launched pursuant to the Libraries and Library Activities Act (Articles 37–41).

Harvesting of the entire domain involves collecting all publicly available content within a defined period from active .hr, .com.hr and .from.hr domains.

For the harvesting process, a list of 138,796 active domains provided by NSK to the .hr Domain Registry at CARNET will be used.

This year, the harvesting robot is expected to collect around 17 TB of web-published content (web pages, images, documents, video content, etc.), which is approximately the amount collected during the 2024 harvesting. All collected content will be publicly available through the Croatian Web Archive website, where all previous harvests are also accessible.

The harvesting robot operates from the IP address 161.53.3.11 and identifies itself as Mozilla/5.0 (compatible; heritrix/3.12.0; + https://haw.nsk.hr/cesta-pitanja).

If the harvesting process affects your websites, please contact @email and @email.

In order to complete the scope of archived web publications, the National and University Library in Zagreb (NSK), in cooperation with SRCE, will for the fifteenth time harvest content published under the national .hr domain, in accordance with the Ordinance on Legal Deposit (Official Gazette 66/2020).

The cooperation between NSK and SRCE on the development of the Croatian Web Archive has been ongoing since 2004, when the collection of legal deposit copies of online publications was launched pursuant to the Libraries and Library Activities Act (Articles 37–41).

Harvesting of the entire domain involves collecting all publicly available content within a defined period from active .hr, .com.hr and .from.hr domains.

For the harvesting process, a list of 138,796 active domains provided by NSK to the .hr Domain Registry at CARNET will be used.

This year, the harvesting robot is expected to collect around 17 TB of web-published content (web pages, images, documents, video content, etc.), which is approximately the amount collected during the 2024 harvesting. All collected content will be publicly available through the Croatian Web Archive website, where all previous harvests are also accessible.

The harvesting robot operates from the IP address 161.53.3.11 and identifies itself as Mozilla/5.0 (compatible; heritrix/3.12.0; + https://haw.nsk.hr/cesta-pitanja).

If the harvesting process affects your websites, please contact @email and @email.