MaxScale

Author	SHA1	Message	Date
Markus Mäkelä	dcf9d7f152	Fix calls to diagnostics_json Add missing listener JSON diagnostics call. Check that the diagnostics_json function exists before calling it. As the protocol modules don't have diagnostics functions, they aren't called. Replace hard-coded strings with constant parameters. This makes it slightly cleaner.	2018-03-20 13:07:27 +02:00
Johan Wikman	236e906d88	Revert "Turn MariaDB Monitor struct to class with public fields" This reverts commit cb6f70119d9857b277306e9af5881fe29c574a32.	2018-02-24 15:37:50 +02:00
Esa Korhonen	cb6f70119d	Turn MariaDB Monitor struct to class with public fields Allows using std::string for strings. Also, cleanup.	2018-02-21 11:00:42 +02:00
Esa Korhonen	b8d3da4968	Add error tolerance to "servers_no_promotion" Previously, if the list contained servers that were not monitored by the monitor yet were valid servers, an error value would be returned and the monitor failed to start. With this update, the non-monitored servers are simply ignored when forming the final list. Also, added printing of the list to diagnostics.	2018-02-12 10:49:28 +02:00
Esa Korhonen	faaf43ff39	Add gtid to monitor diagnostics, clean up formatting Gtid:s are now queried every monitor loop. dignostics() no longer prints slave related info if the server has no slave connection.	2018-02-10 12:32:56 +02:00
Esa Korhonen	fa8f6a5da3	Fix monitor error with empty servers_no_promotion	2018-02-07 16:11:47 +02:00
Esa Korhonen	1cf3de4a74	Add config parameter for excluding servers from failover "servers_no_promotion" is a comma-separated list of servers which cannot be chosen when selecting a new master during failover (auto or manual), or when automatically selecting a new master for switchover (currently disabled). The servers in the list are redirected normally and can be promoted by switchover when manually selecting a new master.	2018-02-07 14:07:10 +02:00
Esa Korhonen	255250652d	Refactor pre-switchover, add similar checks as in failover Now detects some erroneous situations before starting switchover. Switchover can be activated without specifying current master. In this case, the cluster master server is selected.	2018-01-31 10:40:09 +02:00
Markus Mäkelä	396b81f336	Fix in-source builds The internal header directory conflicted with in-source builds causing a build failure. This is fixed by renaming the internal header directory to something other than maxscale. The renaming pointed out a few problems in a couple of source files that appeared to include internal headers when the headers were in fact public headers. Fixed maxctrl in-source builds by making the copying of the sources optional.	2017-11-22 18:40:18 +02:00
Markus Mäkelä	07e58444f6	Improve error message for zero monitor timeout values The error message was not 100% accurate about the value. In addition to that, neither the value itself nor the monitor or parameter names were printed in the error message.	2017-11-17 18:05:03 +02:00
Markus Mäkelä	703230a930	Only write monitor journal when it changes The state of the monitored servers is only persisted if the states of the servers have changed. This removes the unnecessary disk IO caused by the writing on the monitor journal.	2017-11-16 15:38:13 +02:00
Markus Mäkelä	3a78b716b8	Merge branch '2.2' into 2.2-mrm	2017-10-30 11:06:34 +02:00
Markus Mäkelä	551bb81929	Loosen the atomicity requirement for the passive parameter As the passive parameter is only used by the failover and the failover can only be initiated by the monitor, there is no true need to synchronize the reads and write of this parameter. As all runtime changes are protected by the runtime lock, only partial reads are of concern. For the supported platforms, this is not a practical problem and it only confuses the reader when other variables are modified without atomic operations.	2017-10-27 15:31:46 +03:00
Markus Mäkelä	600509be4a	Fix master failure tracking The master failure was assumed to be the only master related event for each monitoring loop. If the master was switched by an external actor, the monitor tracking would be out of sync.	2017-10-27 15:31:46 +03:00
Markus Mäkelä	2d1e5f46fa	Remove use of timestamps in failover code Using timestamps to detect whether MaxScale was active or passive can cause problems if multiple events happen at the same time. This can be avoided by separating events into actively observed and passively observed events. This clarifies the logic by removing the ambiguity of timestamps. As the monitoring threads are separate from the worker threads, it is prudent to use atomic operations to modify and read the state of the MaxScale. This will impose an happens-before relation between MaxScale being set into passive mode and events being classified as being passively observed.	2017-10-27 15:31:46 +03:00
Esa Korhonen	63c7550196	MXS-1490 Prepare for failover functionality addition Moved mon_process_failover() from monitor.cc to mysql_mon.cc. Renamed some functions and variables related to previous failover functionality to avoid confusion.	2017-10-25 12:24:29 +03:00
Markus Mäkelä	582a65f77c	Do not return empty relationships If no relationships of a particular type are defined for a resource, the key for that relationship should not be defined.	2017-10-23 19:37:24 +03:00
Johan Wikman	df816ea2a9	MXS-1460 Add failover_script parameter The failover script can now be specified in the configuration file.	2017-10-03 15:24:29 +03:00
Markus Mäkelä	8c3c103060	Merge branch '2.2' into 2.2-mrm	2017-10-03 14:52:21 +03:00
Markus Mäkelä	7ca8db14de	MXS-1444: Add monitor parameter alteration The parameter handling for monitors can now be done in a consistent manner by establishing a rule that the monitor owns the parameter object as long as it is running. This will allow parameters to be added and removed safely both from outside and inside monitors. Currently this functionality is only used by mysqlmon to disable failover after an attempt to perform a failover has failed.	2017-10-03 14:50:20 +03:00
Markus Mäkelä	27d1be7f96	Merge branch '2.2' of github.com:mariadb-corporation/MaxScale into 2.2	2017-10-03 14:46:14 +03:00
Markus Mäkelä	bd39284f9c	Merge branch '2.1' into 2.2	2017-10-03 14:30:06 +03:00
Johan Wikman	cf0a87e7f2	MXS-1441 Expose monitor_launch_command So that a specific monitor may create the command and replace monitor specific script variables before giving the command for execution.	2017-10-03 10:32:42 +03:00
Johan Wikman	e295d438d4	MXS-1441 Expose monitor_launch_script So that it can be called directly from a monitor.	2017-10-03 09:19:23 +03:00
Johan Wikman	438b4e0341	Merge branch '2.2' into 2.2-mrm	2017-10-02 15:49:08 +03:00
Johan Wikman	68432bbaa3	Rename MXS_MONITOR::databases to MXS_MONITOR::monitored_servers More descriptive name. Some local varaibles could now also be renamed to be more descriptive, but that's for another day.	2017-10-02 15:33:58 +03:00
Johan Wikman	8d03876e3e	Rename MXS_MONITOR_SERVERS to MXS_MONITORED_SERVER An element in a linked list is not a list.	2017-10-02 15:05:17 +03:00
Markus Mäkelä	d4fd34cecd	MXS-1446: Move failover parameters into mysqlmon The `failover` and `failover_timeout` parameters are now declared as a part of the mysqlmon module. Changed the implementation of the failover function so that the dependencies on the monitor struct can be removed or moved into parameters.	2017-09-28 08:23:34 +03:00
Markus Mäkelä	ef115208e6	MXS-1446: Move failover to mysqlmon Split the state change processing and failover handling into two separate functions and added a call to the failover function into mysqlmon. This prevents unintended behavior when failover is enabled for non-mysqlmon monitors. The parameter itself still needs to be moved into mysqlmon. Moved the failover documentation to the mysqlmon documentation as it is specific to this monitor.	2017-09-28 07:54:42 +03:00
Markus Mäkelä	0d6c06f33d	MXS-1446: Add heartbeat conversion macros The macros make the conversion from heartbeats to seconds more convenient and consistent.	2017-09-27 19:44:25 +03:00
Markus Mäkelä	667440fbef	MXS-1446: Calculate the monitor event only once As the monitor event is now stored in the server, it can be re-used when the event is converted to string form. This also fixes the problem of state calculation taking place when the event happened in the past.	2017-09-27 19:44:25 +03:00
Markus Mäkelä	4c3d6f6884	MXS-1446: Add execution of dummy failover command The failover command is simulated by executing a call to /usr/bin/echo with all possible monitor parameters. This allows testing of the failover mechanism without actually using the failover command.	2017-09-27 19:44:21 +03:00
Markus Mäkelä	316f792242	MXS-1446: Make `failover_timeout` configurable The time that MaxScale waits for a failover is now configurable.	2017-09-27 19:37:41 +03:00
Markus Mäkelä	ef2ee38ccf	MXS-1446: Store more detailed event information The timestamp of the last change from passive to active is now tracked. This, with the timestamps of the last master_down and master_up events, allows detection of cases when MaxScale was failed over but the failover was not done. Currently, only a warning is logged if no new master has appeared within 90 seconds of a master_down event and MaxScale was set to active from passive. The last event and when the event was triggered is now shown for all servers. The latest change from passive to active is also shown.	2017-09-27 19:32:58 +03:00
Markus Mäkelä	3e1d89ff17	MXS-1446: Store last triggered event for each server When an event occurs on a server, it is now stored so that the last event for each server is known. This allows a state change to trigger an event even if, at the time of the event, no action was taken. This change is only cosmetic as no functionality is implemented.	2017-09-27 19:32:58 +03:00
Markus Mäkelä	ab2286235f	Merge branch '2.2' into 2.2-mrm	2017-09-27 19:32:39 +03:00
Markus Mäkelä	f20005dddc	Add missing parameters to `alter monitor` The `script_timeout` and `journal_max_age` parameters weren't handled in the monitor alteration code. Also added missing documentation to maxadmin help output for `alter monitor`.	2017-09-27 19:26:05 +03:00
Johan Wikman	56b947b27d	MXS-1445: Provide credentials to scripts If an invoked script must access servers, it needs credentials. When invoked, a script can now be provided with the monitor credentials of MaxScale using the variable CREDENTIALS. It will be expanded like user:password@[...]:N1,user:password@[...]:N2 for every server the monitor in question is monitoring. That is, irrespective of whether it is a master or a slave, running or not. Thus, a failover script could be specified like: [MyMonitor] type=monitor module=mysqlmon ... script=.../failover.sh --credentials=$CREDENTIALS --slaves=$SLAVELIST events=master_down Note, it may make sense to introduce specific failover (and switchover) keywords, but with the above addition it is possible to start experimenting with failover scripts.	2017-09-22 13:54:14 +03:00
Markus Mäkelä	ab777c76c7	Add CHILDREN to monitor scripts The CHILDREN parameter expands to a list of server IPs and ports that are direct descendants of the server that initiated the event. Also added a note that the variables can expand to empty strings if nothing matches the criteria of the variable.	2017-09-18 13:12:17 +03:00
Markus Mäkelä	9080072de5	Add PARENT variable to monitor scripts The scripts now replace the PARENT variable with the IP and port of the server that is the direct parent node of the server that initiated the event. For master-slave clusters, this will be the master IP if the server that triggered the event is a slave.	2017-09-18 13:00:02 +03:00
Markus Mäkelä	130b686d9b	MXS-1405: Log subprocess output immediately When the subprocess outputs a line, the message should be logged immediately. This allows automated timestamps for the output of the executed subprocess.	2017-09-18 11:39:33 +03:00
Markus Mäkelä	7e6e8d3e29	MXS-1405: Capture subprocess output The output by the subprocesses launched by the externcmd system is now captured and logged.	2017-09-18 11:39:33 +03:00
Markus Mäkelä	fe40511d97	MXS-1405: Take script_timeout into use The script_timeout parameter is now used by all monitors.	2017-09-14 12:34:34 +03:00
Markus Mäkelä	9d3772a67e	MXS-1262: Minor improvements to monitor journals Moved 4 byte get/set into utils header. The byte packing functions in maxscale/protocol/mysql.h should be migrated to the utils directory where they can also be used by non-mysql code. The temporary files are now generated with mkstemp. This will prevent conflicts with multiple monitors operating on the same temporary journal even though it is impossible in practice. Added missing error messages to a couple of the functions.	2017-08-11 11:35:13 +03:00
Markus Mäkelä	53bf21f785	MXS-1262: Use monitor journals in all monitors All monitors now persist the state of the server in a monitor journal file. Moved the removal of stale journals into the core and removed them from the monitor journal interface.	2017-08-11 04:09:08 +03:00
Markus Mäkelä	b448b129d0	MXS-1262: Move journal_max_age to MaxScale core The parameter is now defined in the monitor. Further refactoring is needed to make the interface of the journal system simpler.	2017-08-11 04:09:08 +03:00
Markus Mäkelä	837d57f4f4	MXS-1262: Move monitor journals into the core The journaling functionality is now in the core. Only the MySQL Monitor is using it.	2017-08-11 04:09:07 +03:00
Markus Mäkelä	05d185fc02	Fix monitor repurposing The monitor active state is now modified under the same lock. This should make creation and destruction of monitors deterministic.	2017-08-09 11:39:24 +03:00
Markus Mäkelä	e133e758a6	MXS-1300: Fix deletion of monitors The monitors should only be reused if they have the same name and they use the same module. This way the only difference is in configuration. Fixed MaxCtrl detection of bad options and altered monitor creation test to expect correct results. Also improved some of the error messages.	2017-08-09 11:39:24 +03:00
Markus Mäkelä	512c3c018d	Add recycling of destroyed monitors If a destroyed monitor is created again, it will be reused. This should prevent excessive memory growth when the same monitor is created and destroyed again.	2017-08-09 11:39:24 +03:00

1 2

75 Commits