Update MariaDBMonitor documentation

Mention the secondary use of switchover_timeout. Formatting fixes.
2018-11-07 13:20:02 +02:00
parent 07d497a672
commit 56d28d703c
1 changed files with 33 additions and 34 deletions
--- a/Documentation/Monitors/MariaDB-Monitor.md
+++ b/Documentation/Monitors/MariaDB-Monitor.md
@ -311,16 +311,13 @@ master.
   2. Disable the *read\_only*-flag.
   3. Enable scheduled server events (if event handling is on).
   4. Run the commands in `promotion_sql_file`.
- 5. Start replication from external master is one existed.
+   5. Start replication from external master if one existed.
 4. Redirect all other slaves to replicate from the new master:
   1. STOP SLAVE and RESET SLAVE
   2. CHANGE MASTER TO
   3. START SLAVE
 5. Check that all slaves are replicating.
 Failover may lose events if no slave managed to replicate the events before the
 master went down.
 **Switchover** swaps a running master with a running slave. It does the
 following:
@ -492,6 +489,11 @@ The backends must all use GTID-based replication, and the domain id should not
 change during a switchover or failover. Master and slaves must have
 well-behaving GTIDs with no extra events on slave servers.
 Failover cannot be performed if MaxScale was started only after the master
 server went down. This is because MaxScale needs reliable information on the
 gtid domain of the cluster and the replication topology in general to properly
 select the new master.
 Failover may lose events. If a master goes down before sending new events to at
 least one slave, those events are lost when a new master is chosen. If the old
 master comes back online, the other servers have likely moved on with a
@ -614,18 +616,15 @@ encrypted with the same key to avoid erroneous decryption.
 #### `failover_timeout` and `switchover_timeout`
-Time limit for the cluster failover and switchover in seconds. The default values
+Time limit for failover and switchover operations, in seconds. The default
-are 90 seconds.
+values are 90 seconds for both. `switchover_timeout` is also used as the time
 limit for a rejoin operation. Rejoin should rarely time out, since it is a
 faster operation than switchover.
 If no successful failover/switchover takes place within the configured time
 period, a message is logged and automatic failover is disabled. This prevents
 further automatic modifications to the misbehaving cluster.
 `failover_timeout` also controls how long a MaxScale instance that has
 transitioned from passive to active will wait for a failover to take place after
 an apparent loss of a master server. If no new master server is detected within
 the configured time period, failover will be initiated again.
 #### `verify_master_failure` and `master_failure_timeout`
 Enable additional master failure verification for automatic failover.