summaryrefslogtreecommitdiffstats
path: root/plugins
Commit message (Collapse)AuthorAgeFilesLines
* increase byte size in socket.recev() in livestatusHEADmasterRamesh Nachimuthu2016-09-181-2/+2
| | | | | | | | | | | | | Currently we are only retriving 8192 bytes of data from livestatus for any livestatus query. This is not enough for some cases where we have huge number of volumes and bricks. So increasing the byte size to 1000000. Change-Id: Icd43c158bbb6ac4e233bd5dcbf6ce66bdf2335d5 Bug-Url: https://bugzilla.redhat.com/1351749 Signed-off-by: Ramesh Nachimuthu <rnachimu@redhat.com> Reviewed-on: http://review.gluster.org/15510 Reviewed-by: Sahina Bose <sabose@redhat.com>
* nagios-plugins: Added heal info monitoringSahina Bose2016-03-242-4/+26
| | | | | | | | | | | | | | Added a plugin to monitor heal info for a volume Corrected pep8 errors Changed default monitoring interval to 1 min for heal info monitoring Bug-Url: https://bugzilla.redhat.com/1267586 BUG: 1267586 Change-Id: I1c51d4d0f2129a7bb19f44b63382fcfc7384dc40 Signed-off-by: Sahina Bose <sabose@redhat.com> Reviewed-on: http://review.gluster.org/12262 Reviewed-by: Ramesh N <rnachimu@redhat.com>
* nagios-plugins: fixing quorum pluginSahina Bose2015-11-301-8/+7
| | | | | | | | | | | Fixing the quorum service plugin flipping to OK state when the servers are down - due to wrong query of services BUG: 1284874 Change-Id: I5b6201f430778b92bd30c95fe3c37f4d89240eb8 Signed-off-by: Sahina Bose <sabose@redhat.com> Reviewed-on: http://review.gluster.org/12735
* server-addons: Added logic to determine vol status of disperse-volDarshan N2015-08-161-37/+54
| | | | | | | | | | | | | This patch adds logic to check_vol_server plugin to determine the status of disperse and distributed dispersed volume. With this patch volume status of dispersed and distributed volume are correctly determined. Change-Id: I4c0ee4a63c00201127fe50be424c9f9d3fa5300d Bug-Url: https://bugzilla.redhat.com/1235651 Signed-off-by: Darshan N <dnarayan@redhat.com> Reviewed-on: http://review.gluster.org/11503 Reviewed-by: Sahina Bose <sabose@redhat.com>
* server-addons: pep8 fixes.Darshan N2015-08-163-9/+9
| | | | | | | Change-Id: I920ed33f0e257dd741d1c6df39b46efebe9abd02 Signed-off-by: Darshan N <dnarayan@redhat.com> Reviewed-on: http://review.gluster.org/11836 Reviewed-by: Sahina Bose <sabose@redhat.com>
* server-plugin: To check if any vol has server side quorumv1.1.0Sahina Bose2015-05-312-3/+30
| | | | | | | | | | | | | | | | Added a server side plugin to change status from Pending to OK/UNKNOWN depending on if volumes have server side quorum turned on Renamed service to ensure service definition gets updated on upgrades. Bug-Url: https://bugzilla.redhat.com/show_bug.cgi?id=1106421 Change-Id: I1f1ca1dcf56872342ec8e25396a89163d22aedeb Signed-off-by: Sahina Bose <sabose@redhat.com> Reviewed-on: http://review.gluster.org/8023 Reviewed-by: darshan n <dnarayan@redhat.com> Reviewed-by: Ramesh N <rnachimu@redhat.com>
* nagios-config: Rename self-heal status pluginSahina Bose2015-05-281-1/+1
| | | | | | | | | | Renaming self-heal status plugin to split-brain status Change-Id: Icdb934904eda9c47ccdc4ba41a9204048918f0a9 Bug-Url: https://bugzilla.redhat.com/1221872 Signed-off-by: Sahina Bose <sabose@redhat.com> Reviewed-on: http://review.gluster.org/10908
* autoconf: validate the nagios server address in auto-configRamesh Nachimuthu2015-05-193-17/+101
| | | | | | | | | | | | | | | | | | | | | Currently Nagios server address entered by the user during auto config was not being verified. This patch helps to verify the address entered by the user. If IP address is given as the nagios server address then it checks the pattern and verifies that it is mapped to one of the non loopback device in the host If user enters fqdn name, then it tries to resolve it, also it verifies that resolved IP address maps to one of the non loopback device in the host. Bug-Url: https://bugzilla.redhat.com/1127657 Change-Id: I88d67cc6d8fa05f2934922fbc0d8e757b1d73e43 Signed-off-by: Ramesh Nachimuthu <rnachimu@redhat.com> Reviewed-on: http://review.gluster.org/7740 Reviewed-by: darshan n <dnarayan@redhat.com> Reviewed-by: Sahina Bose <sabose@redhat.com>
* plugins: Fixed issue with str comparisonv1.0.0Sahina Bose2014-12-021-2/+2
| | | | | | | | | | | | Wrongly used str.contains in python. Fixed this Change-Id: Idea139329dc430294fc016b30db2578335bb3666 Bug-Url: https://bugzilla.redhat.com/1109752 Signed-off-by: Sahina Bose <sabose@redhat.com> Reviewed-on: http://review.gluster.org/9226 Reviewed-by: Ramesh N <rnachimu@redhat.com> Reviewed-by: kasturi narra <kasturi.narra@gmail.com> Reviewed-by: Kanagaraj M <kmayilsa@redhat.com>
* plugins: Handle volume locked errorsSahina Bose2014-11-251-2/+8
| | | | | | | | | | | | | | | When plugin returns unknown status due to transaction in progress, the plugin should try again after some time to avoid propogating this error to Nagios Change-Id: I83fa7c3ec7382d1263e36995662df83b5ae39aa6 Bug-Url: https://bugzilla.redhat.com/show_bug.cgi?id=1109752 Signed-off-by: Sahina Bose <sabose@redhat.com> Reviewed-on: http://review.gluster.org/9192 Reviewed-by: Ramesh N <rnachimu@redhat.com> Reviewed-by: Kanagaraj M <kmayilsa@redhat.com> Reviewed-by: Bala FA <barumuga@redhat.com>
* nagios-server-addons:_executeRandomHost to check for the status of glusterd ↵Nishanth Thomas2014-11-171-2/+15
| | | | | | | | | | | | | | | | | | | service During the creation of eligible list of hosts, need to consider the status of glusterd service also. this is included. Also added some additional checks to handle the errors https://bugzilla.redhat.com/show_bug.cgi?id=1136205 https://bugzilla.redhat.com/show_bug.cgi?id=1109843 Signed-off-by: Nishanth Thomas <nthomas@redhat.com> Change-Id: Ic757ecc0df5ef3c261048a961af59715ca889a26 Reviewed-on: http://review.gluster.org/9066 Reviewed-by: Ramesh N <rnachimu@redhat.com> Tested-by: Nishanth Thomas <nishusemail@gmail.com> Reviewed-by: Shubhendu Tripathi <shtripat@redhat.com> Reviewed-by: Sahina Bose <sabose@redhat.com>
* autoconf: unregister the temp_node1 instead of deleting it.Ramesh Nachimuthu2014-11-171-13/+14
| | | | | | | | | | | | | | | | | | | | | | | Unregistering the temp_node1 instead of deleting it while configuring nagios . new field 'register' will be added to temp_node1 and its services with value 0, so that it won't be considered by nagios. If both files, temp_node1.cfg and temp_node1.cfg.sample are present during upgrade then we will remove the file temp_node1.cfg.sample and add the 'register 0' flag to definitions in temp_node1.cfg file. Bug-Url: https://bugzilla.redhat.com/1162446 Bug-Url: https://bugzilla.redhat.com/1162443 Bug-Url: https://bugzilla.redhat.com/1115387 Change-Id: I3c76d6c6a9a9e411315859e07238a7902ddd2380 Signed-off-by: Ramesh Nachimuthu <rnachimu@redhat.com> Reviewed-on: http://review.gluster.org/9123 Reviewed-by: Kanagaraj M <kmayilsa@redhat.com> Reviewed-by: Sahina Bose <sabose@redhat.com>
* server-plugin: consider all status of volume in cluster statusRamesh Nachimuthu2014-11-111-2/+22
| | | | | | | | | | | | | | | | | | | | | | Consider the 'UNKNOWN' status of volumes while calculating the cluster status. Following will be the cluster state and state information. Cluster State State Information UP "OK : None of the Volumes in the cluster are in Critical State" UP "OK : No Volumes present in the cluster" UP "WARNING : Some Volumes in the cluster are in Critical State" UP "WARNING : Some Volumes in the cluster are in Unknown State" UP "WARNING : Some Volumes in the cluster are in Warning State" UP "WARNING : All Volumes in the cluster are in Warning State" DOWN "CRITICAL: All Volumes in the cluster are in Critical State" DOWN "CRITICAL: All Volumes in the cluster are in Unknown State" Bug-url: https://bugzilla.redhat.com/1128007 Change-Id: I06fb1697cb4919420ab6d6ea54e9d9ee96820d2a Signed-off-by: Ramesh Nachimuthu <rnachimu@redhat.com> Reviewed-on: http://review.gluster.org/9053 Reviewed-by: Shubhendu Tripathi <shtripat@redhat.com> Reviewed-by: Sahina Bose <sabose@redhat.com>
* autoconf: fix the error message for duplicate hostRamesh Nachimuthu2014-10-291-1/+1
| | | | | | | | | | | Modified the error message for the case when there is already a host with same name existing. Change-Id: I2e6b5ef80baa0a45d94f069980ac81b1fbd8c058 Bug-Url: https://bugzilla.redhat.com/1112201 Signed-off-by: Ramesh Nachimuthu <rnachimu@redhat.com> Reviewed-on: http://review.gluster.org/8658 Reviewed-by: Sahina Bose <sabose@redhat.com>
* autoconfig: avoid renaming hostnames in nagiosRamesh Nachimuthu2014-10-292-0/+19
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Auto Config tries to use the output of 'hostname' command as a host_name for node in nagios. But if the hostnames are not unique, then it uses IP Address as node_name name in nagios. If after removal of a node, hostnames becomes unique then auto config renames all the host configuration in nagios to their hostnames instead of ip address. It creates multiples issues like removing all the nodes, loss of time series data and unnecessary confusion to the user. Fixing this issue by retaining the perviously configured host names always by going through the host uuid. So if duplication comes after a node addition then only that node will be configured with ip address as host_name but all other existing hosts will not be touched. Similary, even though there is no duplication after the removal of a node with duplicate name, auto config will retain the previously configured hostnames (which are ip addresss becasue there was a hostname duplication earlier) instead of renaming. Change-Id: I0abef47e2e495a591dd3901d2b1766a04c59a701 Bug-Url: https://bugzilla.redhat.com/1139228 Bug-Url: https://bugzilla.redhat.com/1139505 Signed-off-by: Ramesh Nachimuthu <rnachimu@redhat.com> Reviewed-on: http://review.gluster.org/8699 Reviewed-by: darshan n <dnarayan@redhat.com> Reviewed-by: Sahina Bose <sabose@redhat.com>
* nagios-server-addons:executeRandomHost to send request with correct host addressNishanth Thomas2014-10-131-5/+9
| | | | | | | | | | | | | | | When the inital request fails, executeRandomHost is supposed to iterate over the list of avialable hosts and resend the request until get a successful response. This part of code was falty, where the request was send without getting the proper ipaddress/hostname Change-Id: I1f1c7a5bccba3bd601c9e3241083784f15a2c4b6 Bug-url: https://bugzilla.redhat.com/show_bug.cgi?id=1109843 Signed-off-by: Nishanth Thomas <nthomas@redhat.com> Reviewed-on: http://review.gluster.org/8101 Reviewed-by: Sahina Bose <sabose@redhat.com> Tested-by: Nishanth Thomas <nishusemail@gmail.com>
* autoconf: verify the configurations before restarting nagiosRamesh Nachimuthu2014-09-153-8/+30
| | | | | | | | | | | | | Verify the Nagios configuration before restarting the nagios server. It helps in accidently stoping the nagios server because of config error. Change-Id: I8c7cb5663bf42d73812577e9b162434cb2ecfdfe Bug-Url: https://bugzilla.redhat.com/1138943 Signed-off-by: Ramesh Nachimuthu <rnachimu@redhat.com> Reviewed-on: http://review.gluster.org/8131 Reviewed-by: Shubhendu Tripathi <shtripat@redhat.com> Reviewed-by: Sahina Bose <sabose@redhat.com>
* server-addons: Fix typo in error messageRamesh Nachimuthu2014-07-021-1/+1
| | | | | | | | | | Fixing typo 'NPRE' in server_utils.execNRPECommand() Bug-Url: https://bugzilla.redhat.com/1108569 Change-Id: I9b29780661353eb0b82ef24a390c2d2091f7ea4e Signed-off-by: Ramesh Nachimuthu <rnachimu@redhat.com> Reviewed-on: http://review.gluster.org/8130 Reviewed-by: Sahina Bose <sabose@redhat.com>
* fix pep8 errorsTimothy Asir Jeyasingh2014-06-265-115/+116
| | | | | | | | Change-Id: Ia0f570dfb74d6c41484e509f704df12d3b96db17 Signed-off-by: Timothy Asir Jeyasingh <tjeyasin@redhat.com> Reviewed-on: http://review.gluster.org/8161 Tested-by: Timothy Asir <tim.gluster@gmail.com> Reviewed-by: Shubhendu Tripathi <shtripat@redhat.com>
* autoconfig: fixing issue in finding duplicate hostsRamesh Nachimuthu2014-06-161-4/+6
| | | | | | | | | | | | | | When a host is in disconnected status, we will not communicate with the host and will not be able to get the hostname. So we don't need to consider those disconnected hosts in finding duplicate hosts. Change-Id: Ie3fbef03635d46fd9d95e5b1f4df7435e800c55d Bug-Url: https://bugzilla.redhat.com/1109723 Signed-off-by: Ramesh Nachimuthu <rnachimu@redhat.com> Reviewed-on: http://review.gluster.org/8074 Reviewed-by: Sahina Bose <sabose@redhat.com> Tested-by: Sahina Bose <sabose@redhat.com>
* nagios-server-addons:Fixed issues with executeRandomHostNishanth Thomas2014-06-161-6/+12
| | | | | | | | | | | | | | | When a node in the cluster is down, nrpe request sent to the node gets timed out resulting a failure in executeRandomHost. This is fixed by selecting only those nodes which are UP Change-Id: I5b22dcb30c0e2a006a549dc642b16db278c9c0f1 Bug-Url: https://bugzilla.redhat.com/show_bug.cgi?id=1109025 Signed-off-by: Nishanth Thomas <nthomas@redhat.com> Reviewed-on: http://review.gluster.org/8061 Reviewed-by: Ramesh N <rnachimu@redhat.com> Reviewed-by: Kanagaraj M <kmayilsa@redhat.com> Tested-by: Nishanth Thomas <nishusemail@gmail.com> Reviewed-by: Sahina Bose <sabose@redhat.com>
* autoconfig: Handle sync host deletion in auto config.Ramesh Nachimuthu2014-06-111-0/+15
| | | | | | | | | | | | | | | | | Auto config uses a user configured host to periodically sync the gluster configuration. If the host used for sync itself is removed from the cluster, then auto config will detect this as all other hosts are getting removed from cluster. Fixing this issue by not removing all the hosts except sync host in auto mode. User has to run the auto config manually with differnt host to remove sync host. Change-Id: I18fa51b4ed9674264ea8b2d2f615f7680a59c04e Bug-Url: https://bugzilla.redhat.com/1107998 Signed-off-by: Ramesh Nachimuthu <rnachimu@redhat.com> Reviewed-on: http://review.gluster.org/8024 Reviewed-by: Sahina Bose <sabose@redhat.com>
* nagios-server-addons:volume status based on volume typeNishanth Thomas2014-06-041-12/+102
| | | | | | | | | | | | | | | | Added the volume type in the plugin output The logic of determining the volume status changed based on the volume type.Added the volume type and the list of bricks down in the plugin output Change-Id: Ib8d3111bdcc04264ec8bb6383fcb4fad97a17bab Bug-Url:https://bugzilla.redhat.com/show_bug.cgi?id=1096159 Bug-Url:https://bugzilla.redhat.com/show_bug.cgi?id=1096169 Signed-off-by: Nishanth Thomas <nthomas@redhat.com> Reviewed-on: http://review.gluster.org/7874 Reviewed-by: Kanagaraj M <kmayilsa@redhat.com> Reviewed-by: Sahina Bose <sabose@redhat.com> Tested-by: Nishanth Thomas <nishusemail@gmail.com>
* nagios-server-addons: Fix for dummy config file issue.ndarshan2014-06-031-0/+13
| | | | | | | | | | | | | | | | | | | This patch replaces the configfile node1.cfg(gluster-host) by temp_node1.cfg which is of type linux-server having service PING. And this dummy config file is renamed to temp_node1.cfg.sample during auto-discovery as it is not needed after we configure actual hosts. Also this patch sets the config variable allow_empty_hostgroup_assignment to 1, which is needed for nagios to allow declaration of empty host-group with services. Change-Id: I03cc54975f5ff5857f8d82b66110e049e4195009 Bug-Url: https://bugzilla.redhat.com/show_bug.cgi?id=1091236 Signed-off-by: ndarshan <dnarayan@redhat.com> Reviewed-on: http://review.gluster.org/7952 Reviewed-by: Kanagaraj M <kmayilsa@redhat.com> Reviewed-by: Shubhendu Tripathi <shtripat@redhat.com> Reviewed-by: Sahina Bose <sabose@redhat.com>
* nagios-server-addons: Check hosts status using NRPEShubhendu Tripathi2014-05-301-77/+22
| | | | | | | | | | | | | Enabled to execute check_nrpe for a given host to check its status. Change-Id: I938b78fcbf52cd46f4f493e2c2b8b927614834eb Bug-URL: https://bugzilla.redhat.com/show_bug.cgi?id=1102506 Signed-off-by: Shubhendu Tripathi <shtripat@redhat.com> Reviewed-on: http://review.gluster.org/7923 Reviewed-by: Ramesh N <rnachimu@redhat.com> Reviewed-by: Kanagaraj M <kmayilsa@redhat.com> Reviewed-by: Timothy Asir <tim.gluster@gmail.com>
* server-plugins: Corrected cluster utilization when volume stoppedSahina Bose2014-05-291-13/+26
| | | | | | | | | | Corrected the message returned by plugin when all volumes in cluster are stopped Change-Id: Ie5e0fbbb10fa7d2787a82e1caa7ca86e8c7010a3 Bug-Url: https://bugzilla.redhat.com/1096730 Signed-off-by: Sahina Bose <sabose@redhat.com> Reviewed-on: http://review.gluster.org/7780
* autoconfig: refactoring and adding test casesRamesh Nachimuthu2014-05-282-43/+50
| | | | | | | | | | | | Refactoring the code to use constants for repeatedly used strings and add some test case. Change-Id: I7c88231a51ceea3b1cc94223018ecc704b657512 Bug-Url: https://bugzilla.redhat.com/1099093 Bug-Url: https://bugzilla.redhat.com/1099328 Signed-off-by: Ramesh Nachimuthu <rnachimu@redhat.com> Reviewed-on: http://review.gluster.org/7883 Reviewed-by: Sahina Bose <sabose@redhat.com>
* autoConfig:Support for force syncing specific fields in service configRamesh Nachimuthu2014-05-282-16/+42
| | | | | | | | | | | | | | | | | | | Currently Auto-config won't update any service configurations in nagios. It always adds or deletes services configurations but no update. With the following bugs we have the need to update few values in service configuration. Hence adding the support to update specific fields in service config. All the fields in the list config_generator.SERVICE_FIELDS_TO_FORCE_SYNC will be force updated during auto configuration. With this patch only '_VOL_NAME' and 'notes' field will be updated. Change-Id: I6003ade375d2f0d6e7dee97ef05f7a9af4ce6ffd Bug-Url: https://bugzilla.redhat.com/1099093 Bug-Url: https://bugzilla.redhat.com/1099328 Signed-off-by: Ramesh Nachimuthu <rnachimu@redhat.com> Reviewed-on: http://review.gluster.org/7868 Reviewed-by: Sahina Bose <sabose@redhat.com>
* server-plugin: Corrected the exit status for pluginSahina Bose2014-05-281-2/+1
| | | | | | | | | | | | | | The exit status from NRPE plugin was wrapped with WEXITSTATUS which was earlier used when running commands directly. With the subprocess implementation, this is not required as it always returns 0 for successful execution even when nrpe plugin returns 1/2. Change-Id: I3cda8d4aa5b4f726cb5c7faf05c7f2536115eafe Bug-Url: https://bugzilla.redhat.com/show_bug.cgi?id=1101211 Signed-off-by: Sahina Bose <sabose@redhat.com> Reviewed-on: http://review.gluster.org/7905
* server-addons: fix for hostgroup issue in brick status event handlerRamesh Nachimuthu2014-05-272-0/+66
| | | | | | | | | | | | | | | | | Macro "$HOSTGROUPNAME$" is used in brick status event handler. But this macro gives only one of the host group to which the host belongs to. But in gluster monitoring, all gluster hosts will have two host groups ('gluster-host' and a host group with cluster name). So using the macro "$HOSTGROUPNAMES$" to pass all the host groups to event handler and internally getting the currect cluster name. Change-Id: I61713ecabff52bcd7f585e9f678426370b9b24d4 Signed-off-by: Ramesh Nachimuthu <rnachimu@redhat.com> Reviewed-on: http://review.gluster.org/7761 Reviewed-by: Nishanth Thomas <nishusemail@gmail.com> Reviewed-by: Kanagaraj M <kmayilsa@redhat.com> Reviewed-by: Timothy Asir <tim.gluster@gmail.com>
* autoConfig: create quota/geo rep services conditionallyRamesh Nachimuthu2014-05-231-10/+12
| | | | | | | | | | | Create QUOTA and GEO Replication status services for volume only when respective services are enabled in gluster. Bug-Url: https://bugzilla.redhat.com/1099731 Change-Id: I9813a64f7d2573aaf3b23534643662e198b5457d Signed-off-by: Ramesh Nachimuthu <rnachimu@redhat.com> Reviewed-on: http://review.gluster.org/7838 Reviewed-by: Sahina Bose <sabose@redhat.com>
* autoconfig: delete only auto generated configs in auto-configRamesh Nachimuthu2014-05-232-0/+6
| | | | | | | | | | | | | | | | Currently auto-config removes all the custom services and hosts defined by user. So with this issue, user can't add any extra services on top of services generated by auto-config. Adding a custom filed '__GENERATED_BY_AUTOCONFIG' as part of all the services added by auto-config. Only services with the above custom field will be deleted by auto-config when respective volume/brick/ is removed in gluster. Change-Id: I85d3d971d3b99f52ff603044f7f974746e247737 Signed-off-by: Ramesh Nachimuthu <rnachimu@redhat.com> Reviewed-on: http://review.gluster.org/7750 Reviewed-by: Sahina Bose <sabose@redhat.com>
* autoconf:check if host is already being monitoredRamesh Nachimuthu2014-05-221-0/+20
| | | | | | | | | | | | | | | User may have a host configuration defined already for the host being discovered by auto config. So checking if the host already exist as part of another cluster before auto configuring. Same issue may happen when user gives a differnt cluster name in autoconfig after running for the first time. Change-Id: Ia3dbf8e4895cfe4d24e99eda49313b44dbd25aef Bug-Url: https://bugzilla.redhat.com/1085320 Signed-off-by: Ramesh Nachimuthu <rnachimu@redhat.com> Reviewed-on: http://review.gluster.org/7755 Reviewed-by: Sahina Bose <sabose@redhat.com>
* autoconfig: fix for timeout param issue in auto modeRamesh Nachimuthu2014-05-201-1/+1
| | | | | | | | | | | Passing the timeout parameter to writeDelta() in auto mode also. Change-Id: Ia18ae225e5b6e239b4d4b09357d51844d5a868f5 Signed-off-by: Ramesh Nachimuthu <rnachimu@redhat.com> Reviewed-on: http://review.gluster.org/7817 Reviewed-by: Shubhendu Tripathi <shtripat@redhat.com> Reviewed-by: Kanagaraj M <kmayilsa@redhat.com> Reviewed-by: Timothy Asir <tim.gluster@gmail.com>
* plugins: refactored volume status pluginKanagaraj M2014-05-201-28/+29
| | | | | | | | | | Passing the required arguments to the functions instead of passing 'arg'. Change-Id: Ib6cb654bc448c871a24dc636e6829ccf7523c2c8 Signed-off-by: Kanagaraj M <kmayilsa@redhat.com> Reviewed-on: http://review.gluster.org/7756 Reviewed-by: Sahina Bose <sabose@redhat.com>
* nagios-server-addons: NRPE command with timeoutShubhendu Tripathi2014-05-203-66/+100
| | | | | | | | | | | | | | | | | | | | | | | Introduced a utility method to return a NRPE base command with timeout set externally. Currently if a plugin internally executes NRPE to get the details from the node, there is no mechanism that timeout could be set to more than 10 sec. This method provides the NRPE command with timeout (if passed). This is required for some of the NRPE calls where gluster commands get executed and they might take more time. All the plugins, which execute a NRPE within, can provide optional command line argument for timeout, and same can be used for forming the proper NRPE call with timeout value. Change-Id: Id97624df743664a320a585acc4a85cfcf64d0a07 Signed-off-by: Shubhendu Tripathi <shtripat@redhat.com> Reviewed-on: http://review.gluster.org/7682 Reviewed-by: Sahina Bose <sabose@redhat.com> Tested-by: Sahina Bose <sabose@redhat.com>
* nagios-server-addons : Fixed cluster status plugin to correct queriesNishanth Thomas2014-05-131-10/+7
| | | | | | | | | | | | | | | BZ 1089641, changed the mk-livestatus query so that it picks the right services BZ 1089670, introduced a check to set status to OK if no volumes present in the cluster Bug-Url:https://bugzilla.redhat.com/show_bug.cgi?id=1089670 Bug:Url:https://bugzilla.redhat.com/show_bug.cgi?id=1089641 Change-Id: I93aed685c75ff3455e1129c0d8f9e485b6c5f2b0 Signed-off-by: Nishanth Thomas <nthomas@redhat.com> Reviewed-on: http://review.gluster.org/7708 Reviewed-by: Sahina Bose <sabose@redhat.com> Tested-by: Sahina Bose <sabose@redhat.com>
* autoconf: don't remove disconnected hosts from nagiosRamesh Nachimuthu2014-05-122-33/+50
| | | | | | | | | | | | | | | | | | | | | | | | | Currently auto-config removes all the hosts with disconnected status in 'peer status' from nagios configuration. But that is not correct as a host can goto disconnected status for many differnt reasons and we should not remove the configuration. Similarly disconnected hosts need not be added to nagios configuration during auto discovery. Fixing this by ingnoring all the disconnected hosts in delta calculation. Note: Host name can't be used to identify disconnected hosts as we may not be able to communicate the disconnected host to get the host name. Hence Host UUID in peer status is used to intentify the host. Bug-Url: https://bugzilla.redhat.com/1091170 Change-Id: I3fe730d2545f8cbc8224c576712b09bb4c16e712 Signed-off-by: Ramesh Nachimuthu <rnachimu@redhat.com> Reviewed-on: http://review.gluster.org/7677 Reviewed-by: Sahina Bose <sabose@redhat.com>
* autoconfig: Corrected comparison for volume typeSahina Bose2014-05-081-1/+1
| | | | | | | | | | | The self heal service is only added for a replicate volume type. Corrected the comparison to do so. Change-Id: I78c783bbe9b2f17dc1d5cfa7d86d9a69690be3cd Signed-off-by: Sahina Bose <sabose@redhat.com> Reviewed-on: http://review.gluster.org/7711 Reviewed-by: Ramesh N <rnachimu@redhat.com> Tested-by: Ramesh N <rnachimu@redhat.com>
* server-addons: add active checks for gluster processRamesh Nachimuthu2014-05-071-1/+1
| | | | | | | | | | | Adding checks for gluster process and bricks. Change-Id: Ie688233db2a5da0d1a26d5576507559e7e8f5d2d Signed-off-by: Ramesh Nachimuthu <rnachimu@redhat.com> Reviewed-on: http://review.gluster.org/7697 Reviewed-by: Kanagaraj M <kmayilsa@redhat.com> Reviewed-by: Shubhendu Tripathi <shtripat@redhat.com> Reviewed-by: Sahina Bose <sabose@redhat.com>
* nagios-server-addons: Disabled host event handlerShubhendu Tripathi2014-05-061-2/+3
| | | | | | | | | | | Disabled the event handler for gluster hosts Change-Id: Ib81245add29126761185dec110015ecb73449a3a Signed-off-by: Shubhendu Tripathi <shtripat@redhat.com> Reviewed-on: http://review.gluster.org/7664 Reviewed-by: Kanagaraj M <kmayilsa@redhat.com> Reviewed-by: Ramesh N <rnachimu@redhat.com> Reviewed-by: Timothy Asir <tim.gluster@gmail.com>
* nagios-server: Fixed livestatus query for quotaSahina Bose2014-05-051-1/+1
| | | | | | | | | | As the service name has changed, fixed the livestatus query for Quota service Change-Id: I4967429f357786aad5f6402293054a930371a52e Signed-off-by: Sahina Bose <sabose@redhat.com> Reviewed-on: http://review.gluster.org/7673 Reviewed-by: Kanagaraj M <kmayilsa@redhat.com>
* refactored service description for gluster servicesKanagaraj M2014-05-052-2/+2
| | | | | | | | | | | | | | service descriptions: - Gluster Management - NFS - Self-Heal - Quota - Brick - <brick_dir> Change-Id: I3ece94ef61fcb3fd264e848508362375842dcc6f Signed-off-by: Kanagaraj M <kmayilsa@redhat.com> Reviewed-on: http://review.gluster.org/7671 Reviewed-by: Sahina Bose <sabose@redhat.com>
* nagios-server: Added quorum service to auto configSahina Bose2014-05-051-1/+10
| | | | | | | | | | | | Added a quorum service to auto-config generator. Renamed "Volume Status Quota" to "Volume Quota" service Change-Id: Id83eefeee63bf7c34373196f4cfa2b326e1cb71d Signed-off-by: Sahina Bose <sabose@redhat.com> Reviewed-on: http://review.gluster.org/7661 Reviewed-by: Kanagaraj M <kmayilsa@redhat.com> Reviewed-by: Ramesh N <rnachimu@redhat.com>
* autoconf: handle exceptions in nrpe commandsRamesh Nachimuthu2014-05-041-18/+25
| | | | | | | | | Handling the exceptions and failure from NRPE commands. Change-Id: I36cba2d6adf8484ba134ad3f7aec77437ba07857 Signed-off-by: Ramesh Nachimuthu <rnachimu@redhat.com> Reviewed-on: http://review.gluster.org/7659 Reviewed-by: Sahina Bose <sabose@redhat.com>
* autoconf: send email notification for auto syncRamesh Nachimuthu2014-05-021-11/+33
| | | | | | | | | | | | | | | | Sending a cusotm email notification whenever nagios configurations are changed by auto-config service. note: By default, nagios mail notification command 'notify-service-by-email' doesn't include the comments macro. But in auto config, all the changes are sent as comment to the custom notificiation command. Hence nagios command 'notify-service-by-email' should be changed to include the macro '$NOTIFICATIONCOMMENT$' as part of mail template. Change-Id: Ie5dd23578e08fbc757c46a884a79923ff3c403b2 Signed-off-by: Ramesh Nachimuthu <rnachimu@redhat.com> Reviewed-on: http://review.gluster.org/7626 Reviewed-by: Sahina Bose <sabose@redhat.com>
* vol-status: change volume status based on brick statusKanagaraj M2014-05-022-29/+39
| | | | | | | | | | | | | Currently the brick description will have only brick path. Changed the code to query based on volume name,cluster name. Volume status will be decided based on state of the bricks in nagios. Change-Id: If1e4f85f5643cf2b747ae8837b905f0d5b8d9b22 Signed-off-by: Kanagaraj M <kmayilsa@redhat.com> Reviewed-on: http://review.gluster.org/7611 Reviewed-by: Ramesh N <rnachimu@redhat.com> Reviewed-by: Sahina Bose <sabose@redhat.com>
* autoconf: discover volume list and info separatelyRamesh Nachimuthu2014-05-021-2/+16
| | | | | | | | | | | | | NRPE doesn't support transfering large junk of data as a result. Hence we have to discover the volume details one by one. First fetch the volume list using 'discover_volume_list' NRPE command then fetch the brick details of volume one by one using 'discover_volume_info' NRPE command Change-Id: I2cd3c2309ffa2f1bf5c271e06dd03ade75bf5d92 Signed-off-by: Ramesh Nachimuthu <rnachimu@redhat.com> Reviewed-on: http://review.gluster.org/7631 Reviewed-by: Sahina Bose <sabose@redhat.com>
* plugins: Server side plugin for geo-repSahina Bose2014-05-012-1/+24
| | | | | | | | | Added server side plugin option for geo-rep status Change-Id: I6f604862ec7d4813298a670b7ed2cc0b448faf23 Signed-off-by: Sahina Bose <sabose@redhat.com> Reviewed-on: http://review.gluster.org/7592
* plugins: Changed the server side vol_status commandsSahina Bose2014-05-011-3/+3
| | | | | | | | | | | Changed the nrpe vol_status commands to pass arguments for the different status - self-heal, quota etc Change-Id: Ib26d445a582139fd9d5a07d2b9646a3517843548 Signed-off-by: Sahina Bose <sabose@redhat.com> Reviewed-on: http://review.gluster.org/7638 Reviewed-by: Ramesh N <rnachimu@redhat.com> Reviewed-by: Kanagaraj M <kmayilsa@redhat.com>