+ All Categories
Home > Documents > Geoportal Helpdesk - Support #2509 reason for this enquiry is that we are in the process of...

Geoportal Helpdesk - Support #2509 reason for this enquiry is that we are in the process of...

Date post: 12-Jun-2018
Category:
Upload: vuminh
View: 213 times
Download: 0 times
Share this document with a friend
26
Geoportal Helpdesk - Support #2509 AT - LFRZ: Observed reduction in the number of metadata records 06 Oct 2015 11:10 am - Angelo Quaglia Status: Resolved Start date: 06 Oct 2015 Priority: Normal Due date: Assignee: Angelo Quaglia % Done: 0% Category: Harvesting results Estimated time: 0.00 hour Target version: Submitting Organisation: AT - LFRZ Country: AT - Austria Proactive: Yes Originating UI: Description From: Angelo Quaglia [mailto:[email protected]] Sent: 06 October 2015 11:09 To: Seeger, Oliver; Melzer, Georg Cc: 'Infrastruktur und Betrieb' Subject: Observed reduction in the number of metadata records Importance: High  Dear Sirs, We have observed a decrease in the number of metadata resources returned by the National Discovery Service of Austria.  They went from 846 down to 828.  Is this expected?  The reason for this enquiry is that we are in the process of reconciling what is declared in the annual Monitoring & Reporting documentation with what is actually by returned by Member States Discovery Services and we would like to be sure that what we receive is actually what intended.  Latest report: http://inspire-geoportal.ec.europa.eu/resources/errors/INSPIRE-1b978242-062b-11e1-b7de-52540004b857_20151006-055211/servi es/1/PullResults/ /system/rich/rich_files/rich_files/000/000/341/original/mail248.png  http://inspire-geoportal.ec.europa.eu/resources/INSPIRE-1b978242-062b-11e1-b7de-52540004b857_20151004-190027/services/1/P llResults/ 09 Jul 2018 1/25
Transcript

Geoportal Helpdesk - Support #2509AT - LFRZ: Observed reduction in the number of metadata records06 Oct 2015 11:10 am - Angelo Quaglia

Status: Resolved Start date: 06 Oct 2015Priority: Normal Due date:Assignee: Angelo Quaglia % Done: 0%Category: Harvesting results Estimated time: 0.00 hourTarget version:Submitting Organisation:AT - LFRZ Country: AT - AustriaProactive: Yes Originating UI:Description

From: Angelo Quaglia [mailto:[email protected]]Sent: 06 October 2015 11:09To: Seeger, Oliver; Melzer, GeorgCc: 'Infrastruktur und Betrieb'Subject: Observed reduction in the number of metadata recordsImportance: High

 

Dear Sirs,

We have observed a decrease in the number of metadata resources returned by the National Discovery Service of Austria.

 

They went from 846 down to 828.

 

Is this expected?

 

The reason for this enquiry is that we are in the process of reconciling what is declared in the annual Monitoring & Reportingdocumentation with what is actually by returned by Member States Discovery Services and we would like to be sure that what wereceive is actually what intended.

 

Latest report:

http://inspire-geoportal.ec.europa.eu/resources/errors/INSPIRE-1b978242-062b-11e1-b7de-52540004b857_20151006-055211/servies/1/PullResults/

/system/rich/rich_files/rich_files/000/000/341/original/mail248.png

 

http://inspire-geoportal.ec.europa.eu/resources/INSPIRE-1b978242-062b-11e1-b7de-52540004b857_20151004-190027/services/1/PllResults/

09 Jul 2018 1/25

 

/system/rich/rich_files/rich_files/000/000/340/original/mail249.png

Best regards,

Angelo Quaglia

 

 

 

Ing. Angelo Quaglia

External Consultant

European Commission, DG Joint Research CentreInstitute for Environment and Sustainability

Digital Earth and Reference Data Unit, T.P. 262

Via E. Fermi, 2749.I-21027 Ispra (VA)Italy

Tel: +39 347 78 88 492Fax: +39 0332 78 6325e-mail: mailto:[email protected]

URL: http://ies.jrc.ec.europa.eu/SDI/sdi-about-us/staff-profiles/angelo-quaglia.html

 

The views expressed are purely those of the writer and may not in any circumstances be regarded as stating an official position of theEuropean Commission.

Related issues:Copied to Geoportal Helpdesk - Support # 2513: AT - LFRZ: Access to the Disco... Closed 06 Oct 2015

History#1 - 06 Oct 2015 04:51 pm - Angelo Quaglia

From: Illmeyer, Manuel [mailto:[email protected]]Sent: 06 October 2015 15:58To:[email protected]: AW: Observed reduction in the number of metadata records

 

Klassifizierung - INTERN: Die Inhalte und Informationen dieser Nachricht sind ausschließlich für die kommunizierenden Unternehmen bestimmt.

09 Jul 2018 2/25

 

Dear Angelo,

 

we have analysed the reports from 2015-10-06 and 2015-10-04 and found out, that there are 18 datasets which are twice in the report from2015-10-04. There only exist 828 distinct IDs in both reports.

 

The duplicate metadata have following IDs (details in the csv attached):

- 0cab4580-e202-4e2f-9edc-5699db618535 - 1036cbad-4f03-4ddd-b825-06ccc8445737 - 285dbc62-82b8-4370-88d2-b34ee1ee0ce4 - 2eded083-f087-425a-ae3e-5cb92d63b0bf - 4e1b0517-f2bd-45e3-a5f1-e9958c5b3618 - 761f8178-8563-4e34-9428-67b7c69e069c - 884bb2e0-96c6-4cb4-b654-c48da995f47e - 985b1670-1068-4ec1-abd2-31711ec5b21f - 9f700f35-c02f-42d3-99b8-28f23ee9bba5 - a1e7080c-2368-4b20-86d3-13c578a258a1 - aafda3f0-900e-4a28-8ca0-b2ea14535800 - b1dd7afd-4865-4760-a665-9b9e13bd57e8 - d0471fb6-565a-4715-aeeb-9a63c6d66e48 - d9d50562-cfea-42e1-b27b-9ba31d3463bd - dbd54804-ac0a-4dde-b035-61f475f6dce5 - e61c8ea4-8484-4fc9-9998-646a433ff092 - e993a684-ab3a-4427-8e6b-ed0eea48e1f1 - f31ad289-0dd6-4811-8b8c-8ccf80e17695

 

Do you know, why they are counted twice?

 

Best Regards,

Manuel Illmeyer

 

Manuel Illmeyer, MSc

Land-, forst- und wasserwirtschaftliches Rechenzentrum GmbH

A-1200 Wien, Dresdner Straße 89

Tel:   ++43 (1) 33 176/414

Fax:  ++43 (1) 33 176/7414

09 Jul 2018 3/25

E-Mail:  [email protected]

WWW:  http://www.lfrz.at

FN 262918 w    Handelsgericht Wien

 

Hinweis:

Diese Nachricht und eventuelle Dateianhänge sind vertraulich und nur für den (die) darin angegebenen Adressaten bestimmt. Jeder Gebrauch, jedeWeitergabe oder Vervielfältigung durch andere als den (die) bestimmten Adressaten ist verboten. Sollten Sie diese Sendung irrtümlich erhalten haben,verständigen Sie bitte unverzüglich per E-Mail den Absender und löschen Sie die Sendung samt allfälligen Dateianhängen aus Ihrem System.

 

Please note:

This message and any attachment thereof are confidential and may be privileged or otherwise protected from disclosure. If you are not the intendedrecipient any use, distribution, copying or disclosure is strictly prohibited. If you have received this message in error, please notify the senderimmediately by return e-mail and delete this message and any attachment thereof from your system.

 

#2 - 06 Oct 2015 04:52 pm - Angelo Quaglia

From: Angelo Quaglia [mailto:[email protected]]Sent: 06 October 2015 16:51To: 'Illmeyer, Manuel'Subject: RE: Observed reduction in the number of metadata records

 

Dear Manuel,

 

Many thanks for your fast and detailed response.

 

During the harvesting documented in the report from 2015-10-04 the fileIdentifier 0cab4580-e202-4e2f-9edc-5699db618535 was present in twodifferent metadata records:

 

1. http://inspire-geoportal.ec.europa.eu/resources/INSPIRE-1b978242-062b-11e1-b7de-52540004b857_20151004-190027/services/1/PullResults/811-80/2.iso19139.xml

Geoportal HTML: http://inspire-geoportal.ec.europa.eu/resources/INSPIRE-1b978242-062b-11e1-b7de-52540004b857_20151004-190027/services/1/PullResults/811-8

09 Jul 2018 4/25

0/datasets/2

 

1. http://inspire-geoportal.ec.europa.eu/resources/INSPIRE-1b978242-062b-11e1-b7de-52540004b857_20151004-190027/services/1/PullResults/811-80/3.iso19139.xml

Geoportal HTML: http://inspire-geoportal.ec.europa.eu/resources/INSPIRE-1b978242-062b-11e1-b7de-52540004b857_20151004-190027/services/1/PullResults/811-80/datasets/3

 

The original GetRecordsResponse received from the Discovery Service is here:

http://inspire-geoportal.ec.europa.eu/resources/INSPIRE-1b978242-062b-11e1-b7de-52540004b857_20151004-190027/services/1/PullResults/811-80/downloaded.xml

 

There are indeed two different MD_Metadata elements inside which the same fileIdentifier is used:

 

http://www.opengis.net/cat/csw/2.0.2"

                        xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"

                        xsi:schemaLocation="http://www.opengis.net/cat/csw/2.0.2 http://schemas.opengis.net/csw/2.0.2/CSW-discovery.xsd">

 

                        numberOfRecordsReturned="10"

                      elementSet="full"

                      nextRecord="821">

 

/system/rich/rich_files/rich_files/000/000/344/original/mail250.png

 

/system/rich/rich_files/rich_files/000/000/342/original/mail251.png

 

09 Jul 2018 5/25

 

I have not checked the other cases, yet, but I assume they are other occurrences of the same problem.

 

Here is a historical chart of the number of metadata retrieved from Austria:

 

/system/rich/rich_files/rich_files/000/000/343/original/mail252.png

 

 

I assume the latest harvesting is not affected by the same issues so please confirm I should promote it to production.

 

 

Best regards,

Angelo

 

#3 - 06 Oct 2015 05:11 pm - Angelo Quaglia

From: Angelo Quaglia [mailto:[email protected]]Sent: 06 October 2015 17:11To: 'Illmeyer, Manuel'Subject: RE: Observed reduction in the number of metadata records

 

Dear Manuel,

There were multiple datasets and multiple services metadata sharing the same fileIdentifier:

 

1036cbad-4f03-4ddd-b825-06ccc844573 – two datasets metadata found

./services/1/PullResults/811-820/downloaded.xml

./services/1/PullResults/811-820/8.iso19139.xml

./services/1/PullResults/811-820/9.iso19139.xml

./services/1/PullResults/811-820/datasets/8/inspire.geoportal.resource.dataset.xml

09 Jul 2018 6/25

./services/1/PullResults/811-820/datasets/9/inspire.geoportal.resource.dataset.xml

 

285dbc62-82b8-4370-88d2-b34ee1ee0ce4 – two service metadata

./services/1/PullResults/811-820/downloaded.xml

./services/1/PullResults/811-820/4.iso19139.xml

./services/1/PullResults/811-820/5.iso19139.xml

./services/1/PullResults/811-820/services/4/inspire.geoportal.resource.service.xml

./services/1/PullResults/811-820/services/5/inspire.geoportal.resource.service.xml

 

2eded083-f087-425a-ae3e-5cb92d63b0bf – two dataset metadata

./services/1/PullResults/801-810/downloaded.xml

./services/1/PullResults/801-810/10.iso19139.xml

./services/1/PullResults/801-810/datasets/10/inspire.geoportal.resource.dataset.xml

 

./services/1/PullResults/811-820/downloaded.xml

./services/1/PullResults/811-820/1.iso19139.xml

./services/1/PullResults/811-820/datasets/1/inspire.geoportal.resource.dataset.xml

./services/1/PullResults/811-820/datasets/1/inspire.geoportal.resource.dataset.solr.xml

 

 

Best regards,

Angelo

 

#4 - 07 Oct 2015 09:12 am - Angelo Quaglia

From: Illmeyer, Manuel [mailto:[email protected]]Sent: 07 October 2015 07:49To: Angelo QuagliaSubject: AW: Observed reduction in the number of metadata records

09 Jul 2018 7/25

 

Klassifizierung - INTERN: Die Inhalte und Informationen dieser Nachricht sind ausschließlich für die kommunizierenden Unternehmen bestimmt.

 

Dear Angelo,

 

thanks for your response. The thing is, we only have these MD_Metadata elements once in our national discovery service. The pull results withidentical file identifiers are the exact same metadata records. So I don’t know if this was a problem with our discovery service or a problem with theGetRecordsResponse.

 

But since the problem didn’t occur on the Pull from  2015-10-06, you can promote it to production.

 

Best Regards,

Manuel

#5 - 07 Oct 2015 09:12 am - Angelo Quaglia- Status changed from Assigned to Resolved

From: Angelo Quaglia [mailto:[email protected]]Sent: 07 October 2015 09:12To: 'Illmeyer, Manuel'Subject: RE: Observed reduction in the number of metadata records

 

Dear Manuel,

It’s done.

 

Best regards,

Angelo

 

 

 

09 Jul 2018 8/25

#6 - 13 Oct 2015 10:05 am - Angelo Quaglia- Status changed from Resolved to Feedback

From: Angelo Quaglia [mailto:[email protected]]Sent: 13 October 2015 10:04To: 'Illmeyer, Manuel'Subject: RE: Observed reduction in the number of metadata records

 

Dear Manuel,

 

The problem is occurring again.

 

The number of metadata records returned keeps changing, depending on when the harvesting is done:

 

Latest (manually promoted to production)

13 Oct 2015, 03:53:14     13 Oct 2015, 04:14:52

http://inspire-geoportal.ec.europa.eu/GeoportalProxyWebServices/resources/INSPIREResource/INSPIRE-1b978242-062b-11e1-b7de-52540004b85720151013-055309/services/1/PullResults/

Result of the interaction with the Discovery Service

Resources available for discovery: 828, Expected Resource Count: 828, Actual Resource Count : 828

NOTE: Additional 8 records have been harvested from Linked Discovery Services. See the Summary of Discovery Services

 

 

Previous:

11 Oct 2015, 18:11:45     11 Oct 2015, 18:29:10

http://inspire-geoportal.ec.europa.eu/GeoportalProxyWebServices/resources/INSPIREResource/INSPIRE-1b978242-062b-11e1-b7de-52540004b85720151011-201141/services/1/PullResults/

Result of the interaction with the Discovery Service

Resources available for discovery: 834, Expected Resource Count: 834, Actual Resource Count : 834

NOTE: Additional 8 records have been harvested from Linked Discovery Services. See the Summary of Discovery Services

09 Jul 2018 9/25

 

 

This time the problem does not affect the record with fileIdentifier “0cab4580-e202-4e2f-9edc-5699db618535”:

 

[tomcat8@s-jrciprap264p INSPIRE-1b978242-062b-11e1-b7de-52540004b857_20151010-183201]$ find . -maxdepth 5   -type f  -name"downloaded.xml"  -exec grep -il "0cab4580-e202-4e2f-9edc-5699db618535" {} \;

./services/1/PullResults/121-130/downloaded.xml

 

I have added the exact timeframe for each harvesting. I hope this helps your investigations

 

May I ask you to further investigate?

 

Best regards,

Angelo

 

 

 

Ing. Angelo Quaglia

External Consultant

European Commission, DG Joint Research CentreInstitute for Environment and Sustainability

Digital Earth and Reference Data Unit, T.P. 262

Via E. Fermi, 2749.I-21027 Ispra (VA)Italy

Tel: +39 347 78 88 492Fax: +39 0332 78 6325e-mail: mailto:[email protected]

URL: http://ies.jrc.ec.europa.eu/SDI/sdi-about-us/staff-profiles/angelo-quaglia.html

 

09 Jul 2018 10/25

The views expressed are purely those of the writer and may not in any circumstances be regarded as stating an official position of the EuropeanCommission.

 

#7 - 13 Oct 2015 06:32 pm - Angelo Quaglia

From: Illmeyer, Manuel [mailto:[email protected]]Sent: 13 October 2015 14:45To: Angelo QuagliaSubject: AW: Observed reduction in the number of metadata records

 

Klassifizierung - INTERN: Die Inhalte und Informationen dieser Nachricht sind ausschließlich für die kommunizierenden Unternehmen bestimmt.

 

Dear Angelo,

 

thanks for the information. There are some changed metadata elements but there are also duplicate metadata uuids in our discovery service. We willinvestigate now, why these elements are delivered twice.

 

Best regards,

Manuel

#8 - 14 Oct 2015 08:57 am - Angelo Quaglia

From: Illmeyer, Manuel [mailto:[email protected]]Sent: 14 October 2015 07:48To: Angelo QuagliaSubject: AW: Observed reduction in the number of metadata records

 

Klassifizierung - INTERN: Die Inhalte und Informationen dieser Nachricht sind ausschließlich für die kommunizierenden Unternehmen bestimmt.

 

Dear Angelo,

 

we have made some improvements on the discovery service. I hope this problem won’t occur again.

 

09 Jul 2018 11/25

Best Regards,

Manuel

#9 - 21 Mar 2016 02:39 pm - Angelo Quaglia

From: Angelo Quaglia [mailto:[email protected]]Sent: 21 March 2016 14:38To: Illmeyer, Manuel ([email protected]) Subject: floating number of harvested metadata records

 

Dear Manuel,

 

Now that you that your account has, finally, been created in our Collaboration Environment, I would appreciate if we could exchange using theCollaboration Environment.

 

So, if you confirm you can update issues, this will be my last direct email to you.

 

This issue is ongoing since some time now and is being tracked at https://ies-svn.jrc.ec.europa.eu/issues/2509

 

Here is a chart that shows the variation in the number of the metadata records harvested from the Austrian National Discovery Service, since 1st

January 2016.

 

Harvestings are usually performed during the night between 6pm and 8am.

 

Apart from the number of metadata records, the Austrian National Discovery Service responds efficiently and reliably.

 

Do you know the reasons behind these fluctuations?

 

If necessary, we have the recordings of each harvesting session, including the GetRecordsReponse as they were received by the INSPIRE Geoportal.

 

If you need any additional information, I am of course available.

09 Jul 2018 12/25

 

 

 

/system/rich/rich_files/rich_files/000/000/520/original/mail166.png

 

 

 

 

Best regards,

 

Angelo

 

#10 - 30 Mar 2016 05:31 pm - Angelo Quaglia

From: Illmeyer, Manuel [mailto:[email protected]]Sent: 24 March 2016 14:43To: Angelo Quaglia Subject: AW: floating number of harvested metadata records

 

Dear Angelo,

 

thanks i have received the invite. I will look into it soon.

 

Best regards,

Manuel

#11 - 30 Mar 2016 05:32 pm - Angelo Quaglia

Dear Manuel,

do you have any updates no this issue?

09 Jul 2018 13/25

Please update directly this issue using the tracker so that I do not have to copy and paste your emails here.

Best regards,

Angelo

#12 - 08 Apr 2016 10:50 am - Angelo Quaglia

From: Angelo Quaglia [mailto:[email protected]]Sent: 08 April 2016 10:49To: Illmeyer, Manuel ([email protected]) Cc: 'Jobst Markus' ; Melzer, Georg ; Seeger, Oliver Subject: (This problem was first reported to Austria on 06 Oct 2015): Floating number of harvested metadata recordsImportance: High

 

Dear Manuel,

 

I urge you to look into this issue that was reported to Austria on 5th October 2015 but has not been addressed yet.

 

Are you in a position to tell me what is the expected number of metadata records expected delivered by the Austrian National Discovery Service?

 

The number of metadata documents made available by the Austrian National Discovery Service keeps going up and down (see the diagram below):

03 Apr 2016, 23:25:01?Resources available for discovery: 911, Expected Resource Count: 911, Actual Resource Count : 911

 

08 Apr 2016, 03:47:44

Resources available for discovery: 901, Expected Resource Count: 901, Actual Resource Count : 901

 

This is the historical trend:

 

/system/rich/rich_files/rich_files/000/000/547/original/mail186.png

 

 

09 Jul 2018 14/25

Best regards,

Angelo

 

#13 - 08 Apr 2016 11:21 am - Manuel Illmeyer

Dear Angelo,

sorry for the late response.

901 is the current and correct amount of metadata records that are reported, respectively harvested, from the Austrian Discvovery Service.

Actually i do know, that there were some metadata, that have been depublished in the last few days and there have also been some changesregarding metadata in recent weeks.

To investigate this issue further we are, from now on, comparing the GetRecord responses from our discovery service, so that we can look into it, ifthere are massive changes again.

Above you mentioned, that you could give us the GetRecordsResponse of your Harvesting sessions. Could you send me some samples e.g. from:

- 06/02/2016 - 21/02/2016 - 06/03/2016 - 10/03/2016 - 21/03/2016

Is the hiosotrical chart accesible for us?

Best regards,

Manuel

#14 - 08 Apr 2016 01:01 pm - Angelo Quaglia- File 20160408.zip added

Thank you Manuel,

I have uploaded here the file 20160408.zip which contains an excel file with the charts and also the original xml file produced by the geoportal. Wehave built a dashboard application but this has not been published, yet. In any case, Excel gives all the needed charting power.

I gather that the total records for Austria is therefore 901 from the National Discovery Service + 8 from the linked discovery services = 911 totalrecords from Austria.

Please note that Austria has chosen the Discovery Client scenario for the registration of the National Discovery Service with the INSPIRE Geoportal.

This means that the INSPIRE Geoportal recursively harvests each Discovery Service (also known as "Linked Discovery Service") described in eachmetadata harvested from the National Discovery Service.

09 Jul 2018 15/25

Therefore, a change in the toal number of metadata documents can be due to the National Discovery Service or to one of the linked ones.

In the validation reports the two figures are kept separate: the counts inside the green box refer to the National Discovery Service, while those in redboxes refer to the Linked Discovery Services:

/system/rich/rich_files/rich_files/000/000/549/original/mail188.png

 

If you click on "Summary of Discovery Services" the page will scroll down to a table that gives the detail for each linked Discovery Service:

Linked Discovery Service Title

Pull Operation Report

Resources available for

discovery

Expected Resource Count

Resources actually

retrieved 

Resource Summary

Average Conformity of

MetadataCSW Suchdienst BEV

Report 971 971  971 http://inspire-geoportal.ec.europa.eu/chart?chds=a&cht=p3&chs=400x100&chd=t:759,210,2&chl=dataset%7Cservice%7Cunknown&chdl=dataset:759%7Cservice:210%7Cunknown:2&chdls=,14&chtt=Resource%20Types

93.38% 

STATISTIK AUSTRIA INSPIRE Suchdienst

Report 254 254  254 http://inspire-geoportal.ec.europa.eu/chart?chds=a&cht=p3&chs=400x100&chd=t:251,3&chl=dataset%7Cservice&chdl=dataset:251%7Cservice:3&chdls=,14&chtt=Resource%20Types

100.00% 

CSW Suchdienst der ZAMG

Report 11 11  11 http://inspire-geoportal.ec.europa.eu/chart?chds=a&cht=p3&chs=400x100&chd=t:8,3&chl=dataset%7Cservice&chdl=dataset:8%7Cservice:3&chdls=,14&chtt=Resource%20Types

100.00% 

AT.GEOLAND.CSW

Report 381 381  372 http://inspire-geoportal.ec.europa.eu/chart?chds=a&cht=p3&chs=400x100&chd=t:276,95,1&chl=dataset%7Cservice%7Cunknown&

09 Jul 2018 16/25

92.63% 

chdl=dataset:276%7Cservice:95%7Cunknown:1&chdls=,14&chtt=Resource%20Types

 

Here are the links to the harvesting reports for the most relevant dates when a spike was observed:

06/02/2016 http://inspire-geoportal.ec.europa.eu/resources/sandbox/INSPIRE-1b978242-062b-11e1-b7de-52540004b857_20160205-183906/services/1/PullResults/

12/02/2016 http://inspire-geoportal.ec.europa.eu/resources/sandbox/INSPIRE-1b978242-062b-11e1-b7de-52540004b857_20160211-043222/services/1/PullResults/

21/02/2016 http://inspire-geoportal.ec.europa.eu/resources/sandbox/INSPIRE-1b978242-062b-11e1-b7de-52540004b857_20160220-181059/services/1/PullResults/

06/03/2016 http://inspire-geoportal.ec.europa.eu/resources/sandbox/INSPIRE-1b978242-062b-11e1-b7de-52540004b857_20160305-185017/services/1/PullResults/

10/03/2016 http://inspire-geoportal.ec.europa.eu/resources/sandbox/INSPIRE-1b978242-062b-11e1-b7de-52540004b857_20160310-002237/services/1/PullResults/

21/03/2016 http://inspire-geoportal.ec.europa.eu/resources/sandbox/INSPIRE-1b978242-062b-11e1-b7de-52540004b857_20160321-053225/services/1/PullResults/

27/03/2016 http://inspire-geoportal.ec.europa.eu/resources/sandbox/INSPIRE-1b978242-062b-11e1-b7de-52540004b857_20160326-180631/services/1/PullResults/

 

 

Additional notes on the Discovery Client Scenario:

The list of services inside the FederateCatalogues element of the National Discovery Service capabilities document, if not empty, is only used to

09 Jul 2018 17/25

restrict the list of Discovery Services that are actually harvested.

If the list is empty, all Linked Discovery Services are harvested.

The Geoportal can actually detect if the same Discovery Service has already been harvested but the method it uses is not officially described in theTechnical Guidelines.

Therefore it respects the scenario description and assumes that different metadata documents describe different resources.

From the Discovery Service Guidelines:

Discovery client approach:

A combination of: the Publish Metadata operation (CSWT Manager.Transaction or CSWT Manager.Harvest) for publishingmetadata; CSW.GetCapabilities (“FederatedCatalogues”) forretrieving federated Discovery Service endpoints and/or CSW.GetRecords for discovering Discovery Service metadata; and CSW.GetRecordsto discover dataset or dataset series metadata from remote locations in a further step following the use of the here described link discovery service

4.3.4.2 Discovery client approach

The discovery client scenario is based on the availability of information on available Discovery Service endpoints in a Member State’s DiscoveryService.

Implementation Requirement 15

Third Party Discovery Services shall be published in the Member State’s Discovery Service using the Publish Metadata operation.Third Party Discovery Services can additionally be published in the “FederatedCatalogues” section of the Discovery Service’s capabilitydocument if they are part of a federated search infrastructure.

 

 

#15 - 18 Apr 2016 02:57 pm - Manuel Illmeyer

Hi Angelo,

thank you for the uploaded files an the explanation.

So you are saying that we can add the discovery services, which are anyway harvested by our national csw, to a list of "Federate Catalogues" so thatthey wont be harvested again by the jrc geoportal? I think this could solve the problem of the changing count of metadata elements shown by the jrcgeoportal.

In Austria (almost) all discovery services are harvested by our national catalogue. We only have problems with one csw cause of curled brackets insidede fileIdentifier element (these are the 8 records from the "INSPIRE Dienst via donau"). If we can solve the problems with them, we wouldn't need toharvest any service from austria, except the national discovery service.

Since we are monitoring the getRecords response from our csw we didn't discover any significant changes in the count of records. Only a rise from 901to 906. But this is expected and explainable due to some changes of a data provider.

Best regards,

09 Jul 2018 18/25

Manuel

#16 - 18 Apr 2016 03:06 pm - Manuel Illmeyer

*edit*

As I read the TG now I unterstand that only the services in the FederateCatalogues list are harvested? The "viadonau-csw" is already promoted there.

#17 - 18 Apr 2016 03:40 pm - Angelo Quaglia

In the Discovery.CLIENT scenario usually all Discovery Services for which metadata exists, are recursively harvested by the Geoportal.

However, if the FederatedCatalogues element is not empty, only the services listed there are harvested.

As I also wrote above:

Additional notes on the Discovery Client Scenario:

The list of services inside the FederateCatalogues element of the National Discovery Service capabilities document, if not empty, is only used torestrict the list of Discovery Services that are actually harvested.

If the list is empty, all Linked Discovery Services are harvested.

 

In any case, I have now an explanation for the bigger fluctuations: the FederatedCatalogues constraint sometimes "disappears" from thecapabilties of the National Discvoery Service of Austria.

The INSPIRE Geoportal records all responses as they are received from the external services.

http://inspire-geoportal.ec.europa.eu/resources/sandbox/INSPIRE-1b978242-062b-11e1-b7de-52540004b857_20160205-183906/services/1/ger/downloaded name="FederatedCatalogues">http://inspire.via-donau.org/Geoportal/csw/discovery?Request=GetCapabilities&Service=CSW&Version=2.0.2 

However, 

http://inspire-geoportal.ec.europa.eu/resources/sandbox/INSPIRE-1b978242-062b-11e1-b7de-52540004b857_20160211-043222/services/1/ger/downloaded

There is no FederatedCatalogues constraint in the capabilities.

 

 

#18 - 18 Apr 2016 05:43 pm - Angelo Quaglia

Dear Manuel,

09 Jul 2018 19/25

as you can see from the table below there are two different kind of issues:

a) Variablity in the column "Records from Top Discovery Service"

This issue I understand you now have it under control

 

b) Variablity in the column "Records from linked Discovery services"

This one occurred for two different reasons:

12/02/2016 => Missing Federated Catalogues constraint

27/03/2016 =>Federated Catalogue did not respond

 

date report URL Records from Top Discovery Service

Records from linked Discovery services

issue

06/02/2016 http://inspire-geoportal.ec.europa.eu/resources/sandbox/INSPIRE-1b978242-062b-11e1-b7de-52540004b857_20160205-183906/services/1/PullResults/

891 8  

12/02/2016 http://inspire-geoportal.ec.europa.eu/resources/sandbox/INSPIRE-1b978242-062b-11e1-b7de-52540004b857_20160211-043222/services/1/PullResults/

918 2830 (1*) No FederatedCatalogues constraint found in the service capabilities.

21/02/2016 http://inspire-geoportal.ec.europa.eu/resources/sandbox/INSPIRE-1b978242-062b-11e1-b7de-52540004b857_20160220-181059/services/1/PullResults/

901 8  

06/03/2016 http://inspire-geoportal.ec.europa.eu/resources/sandbox/INSPIRE-1b978242-062b-11e1-b7de-52540004b857_20160305-185017/services/1/PullResults/

920 8  

10/03/2016 http://inspire-geoportal.ec.europa.eu/resources/sandbox/INSPIRE-1b978242-062b-11e1-b7de-52540004b857_20160310-002237/services/1/PullResults/

890 8  

21/03/2016 http://inspire-geoportal.ec.europa.eu/resources/sandbox/INSPIRE-1b978242-062b

09 Jul 2018 20/25

880 8  

-11e1-b7de-52540004b857_20160321-053225/services/1/PullResults/

27/03/2016 http://inspire-geoportal.ec.europa.eu/resources/sandbox/INSPIRE-1b978242-062b-11e1-b7de-52540004b857_20160326-180631/services/1/PullResults/

906 1608 issue (2*) - Federated Catalogue did not respond

 

(1*)http://inspire-geoportal.ec.europa.eu/resources/sandbox/INSPIRE-1b978242-062b-11e1-b7de-52540004b857_20160211-043222/services/1/ger/dowloaded

Missing Federated Catalogues constraint

 

(2*)http://inspire-geoportal.ec.europa.eu/resources/sandbox/INSPIRE-1b978242-062b-11e1-b7de-52540004b857_20160326-180631/services/1/federateCatalogues/resourceLocator1/discovery/PROBE-OGC_CSW202/err.xml

The resource url: "http://inspire.via-donau.org/Geoportal/csw/discovery?request=GetCapabilities&service=CSW&version=2.0.2" did notrespond within 33000ms

#19 - 19 Apr 2016 12:47 pm - Angelo Quaglia

Dear Manuel,

the INSPIRE Geoportal cannot do much if case (1* ) occurs again.

The problem with case (2*) occurs when all services listed in the FederatedCatalogues constraint fail to respond.

In that case the list is empty and the geoportal harvests all linked Discovery Services.

The point is that the Geoportal needs to contact each linked Discovery Service to check it does not redirect to the main Discovery Service.

I have to think about the best algorithm for this specific case but it will probably be something like:

If errors occurred during the analisys of the FederatedCatalogues and the final list of linked Discovery Services is empty, do not harvest any of them.

 

 

#20 - 19 Apr 2016 05:09 pm - Manuel Illmeyer

Dear Angelo,

09 Jul 2018 21/25

thanks again for the useful information.

We could find the problem for the 1* scenario. This was ozur backup discovery service. This service is running on a different infrastructure in adifferent organisation and serves as backup, if our service isn't available. We told them now to add the 1 service to the federated catalogues. So thisshouldn't happen again.

The 2* scenario is a bit of a problem to us, cause the discovery service listed in the federatedCatalogues section sometimes has maintenancedowntimes. In my opinion, if ther is a service entry in the federatedCatalogues section it shouldn#t harvest all CSW even if this one does not respond.

In fact I got an E-Mail today from the provider of the "INSPIRE CSW viaDonau" which told me that they have a downtime due to maintenance fromfriday (22.04.) evening to monday morning (25.04.). Is there anything we can do about this issue?

Thanks,

Manuel

#21 - 19 Apr 2016 07:10 pm - Angelo Quaglia

Dear Manuel,

many thanks for fixing issue 1.

 

Regarding issue 2, I agree with you in principle.

However, I must take into account that it is allowed and actually quite frequent for the federatedCatalogues section of a service, to contain theendpoint of the service itself. 

In that case, the federatedCatalogues section is not empty but the geoportal is supposed to harvest from all linked services.

If the check failed, the geoportal would not harvest aby linked service.

It is unlikely for this to happen, but I do not want to take the risk because the bahaviour would not be easy to understand.

Please note that I cannot rely on syntactical comparison of URLs, rather I have to contact each endpoint to be sure of which service is actuallyresponding at the other end.

I think that the safest approach is the following one:

Discover CLIENT scenario

if

   - the federated catalogues section is not empty

   - no service is found (excluding the top service itself)

   - at least one endpoint returned an error

then

09 Jul 2018 22/25

   fail the harvesting operation

end if

This would mean that during the maintenance of "INSPIRE CSW viaDonau" the content of Austrian metadata would not get refreshed.

The only workaround would be to add a placeholder CSW endpoint that responds as a CSW but returns no records.

In this way the Geoportal would know that the intention is not to harvest all linked services.

 

 

 

 

 

 

#22 - 20 Apr 2016 08:59 am - Manuel Illmeyer

OK, thanks again.

I will discuss the option with the empty csw internally and write back to you as soon as possible.

#23 - 20 Apr 2016 04:26 pm - Angelo Quaglia

Another option is that I simply override the scenario from CLIENT to CENTRALISED during the maintenance days.

#24 - 25 Apr 2016 10:33 am - Manuel Illmeyer

Sorry, i was out of office. 

The viadonau services should be working again. The long-term solution for Austria is the centralised scenario, but we have to fix our harvestingproblems first. We will switch to the newest geonetwork version in near future, which should solve the issue with the curled brackets.

#25 - 04 May 2016 09:23 am - Manuel Illmeyer

Until the implementation of the new geonetwork in Austria, I will inform you if there are maintenance days for the via Donau CSW again, so you canswitch the scenario during this time.

#26 - 04 May 2016 07:44 pm - Angelo Quaglia

OK, that's fine for me, many thanks.

09 Jul 2018 23/25

#27 - 23 Aug 2016 04:21 pm - Angelo Quaglia- Subject changed from Austria - Land-, forst- und wasserwirtschaftliches Rechenzentrum GmbH: Observed reduction in the number of metadata records to Austria - LFRZ: Observed reduction in the number of metadata records- Submitting Organisation changed from Austria - Land-, forst- und wasserwirtschaftliches Rechenzentrum GmbH to Austria - LFRZ

#28 - 23 Aug 2016 04:22 pm - Angelo Quaglia- Subject changed from Austria - LFRZ: Observed reduction in the number of metadata records to AT - LFRZ: Observed reduction in the number of metadata records- Submitting Organisation changed from Austria - LFRZ to AT - LFRZ

#29 - 24 Aug 2016 06:24 pm - Angelo Quaglia- Category set to Harvesting results

#30 - 21 Sep 2016 06:26 pm - Angelo Quaglia- Country set to AT - Austria

#31 - 13 Apr 2017 10:26 am - Angelo Quaglia- Proactive set to No

Dear Manuel,

the service https://geometadaten.lfrz.at/at.lfrz.discoveryservices/srv/de/csw202declares 926 records:Resources available for discovery: 926, Expected Resource Count: 926, Actual Resource Count : 926

http://www.opengis.net/cat/csw/2.0.2" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.opengis.net/cat/csw/2.0.2http://schemas.opengis.net/csw/2.0.2/CSW-discovery.xsd">        926" numberOfRecordsReturned="1" elementSet="full" nextRecord="2">

 

Yesterday, 928:Result of the interaction with the Discovery ServiceResources available for discovery: 928, Expected Resource Count: 928, Actual Resource Count : 928

 

Could you please tell me what is the expected number of records?

Best regards,

Angelo

 

 

#32 - 13 Apr 2017 10:27 am - Angelo Quaglia- Proactive changed from No to Yes

09 Jul 2018 24/25

#33 - 13 Apr 2017 11:28 am - Manuel Illmeyer

Dear Angelo,

the expected number of records at the moment is 926.

Yesterday we had a harvest rund from our subordinary discovery services. Maybe one of the federal provinces depublished metadata records.

Kind regards,

Manuel

#34 - 13 Apr 2017 11:30 am - Angelo Quaglia- Status changed from Feedback to Resolved

Dear Manuel,

thanks you for your fast feedback.

I am therefore promoting the latest harvesting results to production.

Best regards,

Angelo

Files20160408.zip 2.35 MB 08 Apr 2016 Angelo Quaglia

09 Jul 2018 25/25


Recommended