Table of Contents

maxLevel	2
minLevel	2

...

From the Primary, we can run the discover using the cli tool with debug enabled to get further information:

Code Block
bin ./opha-cli.exe act=discover url_base=http://poller username=xxxxxx password=xxxxxx debug=9

...

Please, notice that in case the server has nodes already, the nodes should be exported and imported again with localised_ids once the cluster_id was is changed, as the nodes information won't have the same cluster_id attribute and they will be treated as remote nodes (They cannot be edited, or polled, as an example).

Code Block
localise_ids=true (default: false), then the cluster id is rewritten to match the local nmis installation

After the change, omkd daemon needs to be restarted.

...

opHA uses user/password to access the registry data from the poller, but once the poller has been discover, it uses a token for authentication. So, we should have enabled the authentication method "token" in the poller.

Check if in om<omk_dir>/conf/opCommon.nmis json we have the following (Being X 1, 2 or 3, not matter the order):

...

Also, the property auth_token_key should be set up in the poller configuration.

More information about authorisation/authentication in opHA.

Other causes

Another potential cause of a 401 error is the linux system clock on the Poller being too far out of sync with the Primary. In this case, you may see an error similar to below, where the auth.log is showing an info message about the token for the user being expired. Ensuring the clocks are in sync will resolve this issue.

...

From the Primary, we can initiate discovery discovery of a peer using the url url https://servername. (using SSL/TLS).

...

This can be set in <omk_dir>/conf/opCommon.nmis json in the poller:

Code Block
'"opha_url_base'" =>: '"https://servername'.domain.com",

If we set the url to https://servername in the discover, the poller is going to send its registry data to the Primary, and the Primary will get the correct url_base for the peer from that information.

If the opha_url_base is blank the Primary will swap the https:// URL for http://

Verify Hostname and URL Base configuration

To verify that the url base and hostnames are correct.

Code Block
grep -E "(opha\|opevents\|opflow\|opcharts\|opconfig\|opflowsp\|opreports\|omkd)_(hostname\|url_base)" /usr/local/omk/conf/opCommon.json

The results should be that omkd and opha values are set but not others.

Code Block

      "opflowsp_url_base" : "",
      "opflowsp_hostname" : "",
      "opflow_opcharts_url_base" : "",
      "opflow_url_base" : "",
      "opflow_hostname" : "",
      "opcharts_url_base" : "",
      "opcharts_hostname" : "",
      "opconfig_hostname" : "",
      "opconfig_url_base" : "",
      "omkd_url_base" : "",
      "opevents_hostname" : "",
      "opevents_url_base" : "",
      "opreports_url_base" : "",
      "opreports_hostname" : "",
      "opreports_opcharts_url_base" : "http://127.0.0.1:8042",
      "opha_hostname" : "lab-ms-primary",
      "opha_url_base" : "https://lab-ms-primary.opmantek.net",

Some data is not updated in the Primary

opHA has a new feature to synchronise only the data that has being added/modified since the last synchronisation. In case some data is not modified, we can perform a force synchronisation, adding some parameters to update only the required data types and nodes:

Code Block
bin/opha-cli.pl act=pull data_types=[nodes\|latest_data\|...] peers=[nodeNames] force=t

...

Different situations have being been identified causing this issue:

If the same node name exist exists in more than one poller, and the configuration item opevents_auto_create_nodes is true, a new Local node will be created in the primary server. This is because, the event is just identified by a node name, and the primary cannot choose with which of the remote nodes assign assigned the event.
If there are two Main primary servers: This situation can cause chaos in the environment, as both primaries will change the nodes from the pollers.
Also, if some catchall data is duplicated in the primary, we would be looking some nodes as duplicates in opCharts.

...

Premature Connection Close

The web server closed the connection before the user agent could receive the whole response or that the user agent got destroyed, which forces all connections to be closed immediately.

We can tune the following values to prevent this error to happen:

In the primary server

If the poller is taking too long to respond, we can increase the value of omkd_inactivity_timeout in <omk_dir>/conf/opCommon.json.

...

If the request is taking too long, we cat can decrease the number of elements for each datatype.

...

Restart omkd is required after the change of this parameter.

Error performing operation - Error getting remote data model

There are several issues which can result in seeing the error message "Error getting remote data model" in the GUI, these are to do with HTTP/HTTPS connectivity and authorisation settings in the primary and poller servers.

Connection error: Connection Refused.

You might be seeing an error in the GUI as follows:

Image Added

Ensure that the opHA API user is configured to be the same as the peer setup, the user should exist in the NMIS Users.nmis file and have permissions assigned, by default this is set to omkapiha, check <omk_dir>/conf/opCommon.json

Code Block
"opha_api_user": "omkapiha",

After changing restart the daemon.

Code Block
systemctl restart omkd

Note: This error can also occur if you upgrade opHA and do not accept the EULA on the pollers. Double check the status of the pollers from the main opHA dashboard on the primary.

Connection error: SSL connect attempt failed error

The error in the GUI would be as follows:

Image Added

In this case, the SSL certificate is likely to be a local certificate authority (CA), or you might be using self-signed SSL certificates, in which case you will need to let the applications know it is OK.

On the primary server change the following configuration option to reflect the same as below to <omk_dir>/conf/opCommon.json in the opHA section.

Code Block
"opha_allow_insecure" : "1",

You may also need to enable the below as well on the primary server:

Code Block
"omk_ua_insecure" : "1",

After changing restart the daemon.

Code Block
systemctl restart omkd

Connection error: there is an authorization problem

In the GUI, you observe the following error:

Image Added

You should ensure that the opHA API user that is defined in opCommon.json on both the Primary/Main Primary and the poller(s) is the same user, and that this user exists in the Users.nmis table. By default the configured user is "omkapiha".

Code Block
"opha_api_user" : "omkapiha",

Teapot error: error saving node to remote

in the GUI if you observe the following error:

Image Added

Check the /usr/local/omk/log/opDaemon.log and if you see the following lines:

[debug] current_app_log: bad log, application_key missing

[error] NodeData::update_resource Error creating node in remote. Reason: 418 I'm a teapot

[debug] 418 I'm a teapot (0.127757s, 7.827/s)

Validate that pollers and primary have the same types set in nmis9/conf/Config.nmis for each of the following: 'nodetype_list', 'nettype_list', 'roletype_list'

An easy way to do this is using the patch_config.pl tool:

Code Block

/usr/local/nmis9/admin/patch_config.pl -r /usr/local/nmis9/conf/Config.nmis roletype_list
/usr/local/nmis9/admin/patch_config.pl -r /usr/local/nmis9/conf/Config.nmis nettype_list
/usr/local/nmis9/admin/patch_config.pl -r /usr/local/nmis9/conf/Config.nmis nodetype_list

If mismatched then update, restart daemons (nmis9d and omkd) then rediscover poller.

Versions Compared

Old Version 18

New Version Current

Key

Verify Hostname and URL Base configuration

Some data is not updated in the Primary

Premature Connection Close

Error performing operation - Error getting remote data model

Connection error: Connection Refused.

Connection error: SSL connect attempt failed error

Connection error: there is an authorization problem

Teapot error: error saving node to remote

Page Comparison

Versions Compared

Old Version 18

New Version Current

Key

Verify Hostname and URL Base configuration

Some data is not updated in the Primary

Premature Connection Close

Error performing operation - Error getting remote data model

Connection error: Connection Refused.

Connection error: SSL connect attempt failed error

Connection error: there is an authorization problem

Teapot error: error saving node to remote