 11960a1e93
			
		
	
	11960a1e93
	
	
	
		
			
			The documentation didn't explain how to estimate how much memory a transaction takes and whether it was a global limit or a per session limit.
		
			
				
	
	
		
			646 lines
		
	
	
		
			27 KiB
		
	
	
	
		
			Markdown
		
	
	
	
	
	
			
		
		
	
	
			646 lines
		
	
	
		
			27 KiB
		
	
	
	
		
			Markdown
		
	
	
	
	
	
| # Readwritesplit
 | |
| 
 | |
| This document provides a short overview of the **readwritesplit** router module
 | |
| and its intended use case scenarios. It also displays all router configuration
 | |
| parameters with their descriptions. A list of current limitations of the module
 | |
| is included and use examples are provided.
 | |
| 
 | |
| [TOC]
 | |
| 
 | |
| ## Overview
 | |
| 
 | |
| The **readwritesplit** router is designed to increase the read-only processing
 | |
| capability of a cluster while maintaining consistency. This is achieved by
 | |
| splitting the query load into read and write queries. Read queries, which do not
 | |
| modify data, are spread across multiple nodes while all write queries will be
 | |
| sent to a single node.
 | |
| 
 | |
| The router is designed to be used with a traditional Master-Slave replication
 | |
| cluster. It automatically detects changes in the master server and will use the
 | |
| current master server of the cluster. With a Galera cluster, one can achieve a
 | |
| resilient setup and easy master failover by using one of the Galera nodes as a
 | |
| Write-Master node, where all write queries are routed, and spreading the read
 | |
| load over all the nodes.
 | |
| 
 | |
| ## Configuration
 | |
| 
 | |
| Readwritesplit router-specific settings are specified in the configuration file
 | |
| of MariaDB MaxScale in its specific section. The section can be freely named but
 | |
| the name is used later as a reference in a listener section.
 | |
| 
 | |
| For more details about the standard service parameters, refer to the
 | |
| [Configuration Guide](../Getting-Started/Configuration-Guide.md).
 | |
| 
 | |
| Starting with 2.3, all router parameters can be configured at runtime. Use
 | |
| `maxctrl alter service` to modify them. The changed configuration will only be
 | |
| taken into use by new sessions.
 | |
| 
 | |
| ## Parameters
 | |
| 
 | |
| ### `max_slave_connections`
 | |
| 
 | |
| **`max_slave_connections`** sets the maximum number of slaves a router session
 | |
| uses at any moment. The default is to use at most 255 slave connections per
 | |
| client connection. In older versions the default was to use all available slaves
 | |
| with no limit.
 | |
| 
 | |
| ```
 | |
| max_slave_connections=<max. number, or % of available slaves>
 | |
| ```
 | |
| 
 | |
| For example, if you have configured MaxScale with one master and three slaves
 | |
| and set `max_slave_connections=2`, for each client connection a connection to
 | |
| the master and two slave connections would be opened. The read query load
 | |
| balancing is then done between these two slaves and writes are sent to the
 | |
| master.
 | |
| 
 | |
| By tuning this parameter, you can control how dynamic the load balancing is at
 | |
| the cost of extra created connections. With a lower value of
 | |
| `max_slave_connections`, less connections per session are created and the set of
 | |
| possible slave servers is smaller. With a higher value in
 | |
| `max_slave_connections`, more connections are created which requires more
 | |
| resources but load balancing will almost always give the best single query
 | |
| response time and performance. Longer sessions are less affected by a high
 | |
| `max_slave_connections` as the relative cost of opening a connection is lower.
 | |
| 
 | |
| ### `max_slave_replication_lag`
 | |
| 
 | |
| **`max_slave_replication_lag`** specifies how many seconds a slave is allowed to
 | |
| be behind the master. If the lag is bigger than the configured value a slave
 | |
| can't be used for routing.
 | |
| 
 | |
| This feature is disabled by default.
 | |
| 
 | |
| 	max_slave_replication_lag=<allowed lag in seconds>
 | |
| 
 | |
| The Readwritesplit-router does not detect the replication lag itself. A monitor
 | |
| such as the MariaDB-monitor for a Master/Slave-cluster is required. This option
 | |
| only affects Master-Slave clusters. Galera clusters do not have a concept of
 | |
| slave lag even if the application of write sets might have lag. When a server is
 | |
| disqualified from routing because of replication lag, a warning is logged. Similarly,
 | |
| when the server has caught up enough to be a valid routing target, another warning
 | |
| is logged. These messages are only logged when a query is being routed and the
 | |
| replication state changes.
 | |
| 
 | |
| ### `use_sql_variables_in`
 | |
| 
 | |
| **`use_sql_variables_in`** specifies where should queries, which read session
 | |
| variable, be routed. The syntax for `use_sql_variable_in` is:
 | |
| 
 | |
|     use_sql_variables_in=[master|all]
 | |
| 
 | |
| The default is to use SQL variables in all servers.
 | |
| 
 | |
| When value `all` is used, queries reading session variables can be routed to any
 | |
| available slave (depending on selection criteria). Queries modifying session
 | |
| variables are routed to all backend servers by default, excluding write queries
 | |
| with embedded session variable modifications, such as:
 | |
| 
 | |
|     INSERT INTO test.t1 VALUES (@myid:=@myid+1)
 | |
| 
 | |
| In above-mentioned case the user-defined variable would only be updated in the
 | |
| master where the query would be routed to due to the `INSERT` statement.
 | |
| 
 | |
| ```
 | |
| [Splitter-Service]
 | |
| type=service
 | |
| router=readwritesplit
 | |
| servers=dbserv1, dbserv2, dbserv3
 | |
| user=maxscale
 | |
| password=96F99AA1315BDC3604B006F427DD9484
 | |
| disable_sescmd_history=true
 | |
| master_failure_mode=fail_on_write
 | |
| ```
 | |
| 
 | |
| ### `connection_keepalive`
 | |
| 
 | |
| Send keepalive pings to backend servers. This feature was introduced in MaxScale
 | |
| 2.2.0. The default value is 300 seconds starting with 2.3.2 and for older
 | |
| versions the feature was disabled by default.
 | |
| 
 | |
| The parameter value is the interval in seconds between each keepalive ping. A
 | |
| keepalive ping will be sent to a backend server if the connection is idle and it
 | |
| has not been used within `n` seconds where `n` is greater than or equal to the
 | |
| value of _connection_keepalive_. The keepalive pings are only sent when the
 | |
| client executes a query.
 | |
| 
 | |
| This functionality allows the readwritesplit module to keep all backend
 | |
| connections alive even if they are not used. This is a common problem if the
 | |
| backend servers have a low _wait_timeout_ value and the client connections live
 | |
| for a long time or if your workload is extremely read-heavy with writes done at
 | |
| lower intervals than the configured _wait_timeout_.
 | |
| 
 | |
| ### `master_reconnection`
 | |
| 
 | |
| Allow the master server to change mid-session. This feature was introduced in
 | |
| MaxScale 2.3.0 and is disabled by default.
 | |
| 
 | |
| When a readwritesplit session starts, it will pick a master server as the
 | |
| current master server of that session. By default, when this master server
 | |
| changes mid-session, the connection will be closed.
 | |
| 
 | |
| If the `master_reconnection` parameter is enabled, the master server is allowed
 | |
| to change as long as the session meets the following criteria:
 | |
| 
 | |
| * The session is already connected to the slave that was chosen to be the new master
 | |
| * No transaction is open
 | |
| * Autocommit is enabled
 | |
| * No `LOAD DATA LOCAL INFILE` is in progress
 | |
| * There are no queries being actively routed to the old master
 | |
| 
 | |
| When `master_reconnection` is enabled in conjunction with either
 | |
| `master_failure_mode=fail_on_write` or `master_failure_mode=error_on_write`, the
 | |
| session can recover from the loss of a master server. This means that when a
 | |
| session starts without a master server and later a slave server that it is
 | |
| connected to is promoted as the master, the session will come out of the
 | |
| read-only mode (described in detail in the
 | |
| [`master_failure_mode`](#master_failure_mode) documentation).
 | |
| 
 | |
| ### `slave_selection_criteria`
 | |
| 
 | |
| This option controls how the readwritesplit router chooses the slaves it
 | |
| connects to and how the load balancing is done. The default behavior is to route
 | |
| read queries to the slave server with the lowest amount of ongoing queries i.e.
 | |
| `LEAST_CURRENT_OPERATIONS`.
 | |
| 
 | |
| The option syntax:
 | |
| 
 | |
| ```
 | |
| slave_selection_criteria=<criteria>
 | |
| ```
 | |
| 
 | |
| Where `<criteria>` is one of the following values.
 | |
| 
 | |
| * `LEAST_GLOBAL_CONNECTIONS`, the slave with least connections from MariaDB MaxScale
 | |
| * `LEAST_ROUTER_CONNECTIONS`, the slave with least connections from this service
 | |
| * `LEAST_BEHIND_MASTER`, the slave with smallest replication lag
 | |
| * `LEAST_CURRENT_OPERATIONS` (default), the slave with least active operations
 | |
| * `ADAPTIVE_ROUTING`, based on server average response times. See below.
 | |
| 
 | |
| The `LEAST_GLOBAL_CONNECTIONS` and `LEAST_ROUTER_CONNECTIONS` use the
 | |
| connections from MariaDB MaxScale to the server, not the amount of connections
 | |
| reported by the server itself.
 | |
| 
 | |
| `LEAST_BEHIND_MASTER` and `ADAPTIVE_ROUTING` do not take server weights into account
 | |
| when choosing a server.
 | |
| 
 | |
| `ADAPTIVE_ROUTING` Measures average server response times. The server averages
 | |
| are used as proxies of server load conditions. At selection time the averages
 | |
| are copied and modified to favor faster servers, while at the same time
 | |
| guaranteeing at lest some traffic to the slowest servers. The server selection
 | |
| is probabilistic based on roulette wheel selection.
 | |
| 
 | |
| #### Server Weights and `slave_selection_criteria`
 | |
| 
 | |
| NOTE: Server Weights have been deprecated in MaxScale 2.3 and will be removed
 | |
| at a later time.
 | |
| 
 | |
| The following formula is used to calculate a score for a server when the
 | |
| `weightby` parameter is defined.
 | |
| 
 | |
| ```
 | |
| score = x / w
 | |
| ```
 | |
| 
 | |
| `x` is the absolute value of the chosen metric (queries, connections) and
 | |
| `w` is the weight of the server. The value of `w` is the relative weight
 | |
| of the server in relation to all the servers configured for the
 | |
| service. The server with the highest score that fulfills all other
 | |
| criteria is chosen as the target server.
 | |
| 
 | |
| Read the [configuration guide](../Getting-Started/Configuration-Guide.md#weightby)
 | |
| for a more detailed example on how the weights are calculated.
 | |
| 
 | |
| For `LEAST_CURRENT_OPERATIONS`, the metric is number of active queries on
 | |
| the candidate server, for `LEAST_GLOBAL_CONNECTIONS` and
 | |
| `LEAST_ROUTER_CONNECTIONS` it is the number of open connections and for
 | |
| `LEAST_BEHIND_MASTER` it is the number of seconds a server is behind the
 | |
| master.
 | |
| 
 | |
| #### Interaction Between `slave_selection_criteria` and `max_slave_connections`
 | |
| 
 | |
| Depending on the value of `max_slave_connections`, the slave selection criteria
 | |
| behave in different ways. Here are a few example cases of how the different
 | |
| criteria work with different amounts of slave connections.
 | |
| 
 | |
| * With `slave_selection_criteria=LEAST_GLOBAL_CONNECTIONS` and
 | |
| `max_slave_connections=1`, each session picks one slave and one master
 | |
| 
 | |
| * With `slave_selection_criteria=LEAST_CURRENT_OPERATIONS` and
 | |
| `max_slave_connections=100%`, each session picks one master and as many slaves
 | |
| as possible
 | |
| 
 | |
| * With `slave_selection_criteria=LEAST_CURRENT_OPERATIONS` each read is load
 | |
| balanced based on how many queries are active on a particular slave
 | |
| 
 | |
| * With `slave_selection_criteria=LEAST_GLOBAL_CONNECTIONS` each read is sent to
 | |
| the slave with the least amount of connections
 | |
| 
 | |
| ### `max_sescmd_history`
 | |
| 
 | |
| **`max_sescmd_history`** sets a limit on how many distinct session commands each
 | |
| session can execute before the session command history is disabled. The default
 | |
| is 50 session commands starting with MaxScale 2.3.0. In older versions, the
 | |
| session command history was disabled by default.
 | |
| 
 | |
| ```
 | |
| # Set a limit on the session command history
 | |
| max_sescmd_history=1500
 | |
| ```
 | |
| 
 | |
| When a session command is executed for the first time, it is stored in
 | |
| memory. Any subsequent executions of the same command are stored as references
 | |
| to the original command. By storing references instead of copies of the data,
 | |
| the amount of memory used is reduced.
 | |
| 
 | |
| If you have long-running sessions which change the session state often, increase
 | |
| the value of this parameter if server reconnections fail due to disabled session
 | |
| command history.
 | |
| 
 | |
| When a limitation is set, it effectively creates a cap on the session's memory
 | |
| consumption. This might be useful if connection pooling is used and the sessions
 | |
| use large amounts of session commands.
 | |
| 
 | |
| ### `disable_sescmd_history`
 | |
| 
 | |
| This option disables the session command history. This way no history is stored
 | |
| and if a slave server fails, the router will not try to replace the failed
 | |
| slave. Disabling session command history will allow long-lived connections
 | |
| without causing a constant growth in the memory consumption.
 | |
| 
 | |
| This option is only intended to be enabled if the value of
 | |
| `max_slave_connections` is lowered below the default value. This will allow a
 | |
| failed slave to be replaced with a standby slave server.
 | |
| 
 | |
| In versions 2.0 and older, the session command history is enabled by default.
 | |
| In version 2.1 and 2.2, the session command history is disabled by default.  In
 | |
| 2.3 and newer versions, the session command is enabled but it is limited to a
 | |
| default of 50 session commands after which the history is disabled.
 | |
| 
 | |
| ```
 | |
| # Disable the session command history
 | |
| disable_sescmd_history=true
 | |
| ```
 | |
| 
 | |
| ### `prune_sescmd_history`
 | |
| 
 | |
| This option prunes the session command history when it exceeds the value
 | |
| configured in `max_sescmd_history`. When this option is enabled, only a set
 | |
| number of statements are stored in the history. This limits the per-session
 | |
| memory use while still allowing safe reconnections. This parameter was added in
 | |
| MaxScale 2.3.4 and is disabled by default.
 | |
| 
 | |
| This parameter is intended to be used with pooled connections that remain in use
 | |
| for a very long time. Most connection pool implementations do not reset the
 | |
| session state and instead re-initialize it with new values. This causes the
 | |
| session command history to grow at roughly a constant rate for the lifetime of
 | |
| the pooled connection.
 | |
| 
 | |
| Each client-side session that uses a pooled connection only executes a finite
 | |
| amount of session commands. By retaining a shorter history that encompasses all
 | |
| session commands the individual clients execute, the session state of a pooled
 | |
| connection can be accurately recreated on another server.
 | |
| 
 | |
| If the session command history pruning is enabled, there is a theoretical
 | |
| possibility that upon server reconnection the session states of the connections
 | |
| are inconsistent. This can only happen if the length of the stored history is
 | |
| shorter than the list of relevant statements that affect the session state. In
 | |
| practice the default value of 50 session commands is a fairly reasonable value
 | |
| and the risk of inconsistent session state is relatively low.
 | |
| 
 | |
| In case the default history length is too short for safe pruning, set the value
 | |
| of `max_sescmd_history` to the total number of commands that affect the session
 | |
| state plus a safety margin of 10. The safety margin reserves some extra space
 | |
| for new commands that might be executed due to changes in the client side
 | |
| application.
 | |
| 
 | |
| ### `master_accept_reads`
 | |
| 
 | |
| **`master_accept_reads`** allows the master server to be used for reads. This is
 | |
| a useful option to enable if you are using a small number of servers and wish to
 | |
| use the master for reads as well.
 | |
| 
 | |
| By default, no reads are sent to the master.
 | |
| 
 | |
| ```
 | |
| # Use the master for reads
 | |
| master_accept_reads=true
 | |
| ```
 | |
| 
 | |
| ### `strict_multi_stmt`
 | |
| 
 | |
| This option is disabled by default since MaxScale 2.2.1. In older versions, this
 | |
| option was enabled by default.
 | |
| 
 | |
| When a client executes a multi-statement query, it will be treated as if it were
 | |
| a DML statement and routed to the master. If the option is enabled, all queries
 | |
| after a multi-statement query will be routed to the master to guarantee a
 | |
| consistent session state.
 | |
| 
 | |
| If the feature is disabled, queries are routed normally after a multi-statement
 | |
| query.
 | |
| 
 | |
| **Warning:** Enable the strict mode only if you know that the clients will send
 | |
|   statements that cause inconsistencies in the session state.
 | |
| 
 | |
| ```
 | |
| # Enable strict multi-statement mode
 | |
| strict_multi_stmt=true
 | |
| ```
 | |
| 
 | |
| ### `strict_sp_calls`
 | |
| 
 | |
| Similar to `strict_multi_stmt`, this option allows all queries after a CALL
 | |
| operation on a stored procedure to be routed to the master. This option is
 | |
| disabled by default and was added in MaxScale 2.1.9.
 | |
| 
 | |
| All warnings and restrictions that apply to `strict_multi_stmt` also apply to
 | |
| `strict_sp_calls`.
 | |
| 
 | |
| ### `master_failure_mode`
 | |
| 
 | |
| This option controls how the failure of a master server is handled. By default,
 | |
| the router will close the client connection as soon as the master is lost.
 | |
| 
 | |
| The following table describes the values for this option and how they treat the
 | |
| loss of a master server.
 | |
| 
 | |
| | Value        | Description|
 | |
| |--------------|-----------|
 | |
| |fail_instantly | When the failure of the master server is detected, the connection will be closed immediately.|
 | |
| |fail_on_write | The client connection is closed if a write query is received when no master is available.|
 | |
| |error_on_write | If no master is available and a write query is received, an error is returned stating that the connection is in read-only mode.|
 | |
| 
 | |
| These also apply to new sessions created after the master has failed. This means
 | |
| that in _fail_on_write_ or _error_on_write_ mode, connections are accepted as
 | |
| long as slave servers are available.
 | |
| 
 | |
| **Note:** If _master_failure_mode_ is set to _error_on_write_ and the connection
 | |
| to the master is lost, by default, clients will not be able to execute write
 | |
| queries without reconnecting to MariaDB MaxScale once a new master is
 | |
| available. If [`master_reconnection`](#master_reconnection) is enabled, the
 | |
| session can recover if one of the slaves is promoted as the master.
 | |
| 
 | |
| ### `retry_failed_reads`
 | |
| 
 | |
| This option controls whether autocommit selects are retried in case of failure.
 | |
| This option is enabled by default.
 | |
| 
 | |
| When a simple autocommit select is being executed outside of a transaction and
 | |
| the slave server where the query is being executed fails, readwritesplit can
 | |
| retry the read on a replacement server. This makes the failure of a slave
 | |
| transparent to the client.
 | |
| 
 | |
| ### `delayed_retry`
 | |
| 
 | |
| Retry queries over a period of time. This parameter takes a boolean value, was
 | |
| added in Maxscale 2.3.0 and is disabled by default.
 | |
| 
 | |
| When this feature is enabled, a failure to route a query due to a connection
 | |
| problem will not immediately result in an error. The routing of the query is
 | |
| delayed until either a valid candidate server is available or the retry timeout
 | |
| is reached. If a candidate server becomes available before the timeout is
 | |
| reached, the query is routed normally and no connection error is returned. If no
 | |
| candidates are found and the timeout is exceeded, the router returns to normal
 | |
| behavior and returns an error.
 | |
| 
 | |
| When combined with the `master_reconnection` parameter, failures of writes done
 | |
| outside of transactions can be hidden from the client connection. This allows a
 | |
| master to be replaced while a write is in progress.
 | |
| 
 | |
| The delayed query retrying mode in readwritesplit does not do any sort of
 | |
| duplicate write detection. To prevent accidental data duplication, it is highly
 | |
| recommended to tune the monitor timeouts to values that produce accurate
 | |
| results.
 | |
| 
 | |
| Duplicate execution of a statement can occur if the connection to the server is
 | |
| lost or the server crashes but the server comes back up before the timeout for
 | |
| the retrying is exceeded. At this point, if the server managed to read the
 | |
| client's statement, it will be executed. For this reason, it is recommended to
 | |
| only enable `delayed_retry` when the possibility of duplicate statement
 | |
| execution is an acceptable risk.
 | |
| 
 | |
| ### `delayed_retry_timeout`
 | |
| 
 | |
| The number of seconds to wait until an error is returned to the client when
 | |
| `delayed_retry` is enabled. The default value is 10 seconds.
 | |
| 
 | |
| ### `transaction_replay`
 | |
| 
 | |
| Replay interrupted transactions. This parameter was added in MaxScale 2.3.0 and
 | |
| is disabled by default. Enabling this parameter implicitly enables both the
 | |
| `delayed_retry` and `master_reconnection` parameters.
 | |
| 
 | |
| When the server where the transaction is in progress fails, readwritesplit can
 | |
| migrate the transaction to a replacement server. This can completely hide the
 | |
| failure of a master node without any visible effects to the client.
 | |
| 
 | |
| If no replacement node becomes available before the timeout controlled by
 | |
| `delayed_retry_timeout` is exceeded, the client connection is closed.
 | |
| 
 | |
| If the results from the replacement server are not identical when the
 | |
| transaction is replayed, the client connection is closed. This means that any
 | |
| transaction with a server specific result (e.g. `NOW()`, `@@server_id`) cannot
 | |
| be replayed successfully but it will still be attempted.
 | |
| 
 | |
| ### `transaction_replay_max_size`
 | |
| 
 | |
| The limit on transaction size for transaction replay in bytes. Any transaction
 | |
| that exceeds this limit will not be replayed. The default value is 1 MiB. This
 | |
| limit applies at a session level which means that the total peak memory
 | |
| consumption can be `transaction_replay_max_size` times the number of client
 | |
| connections.
 | |
| 
 | |
| The amount of memory needed to store a particular transaction will be slightly
 | |
| larger than the length in bytes of the SQL used in the transaction. If the limit
 | |
| is ever exceeded, a message will be logged at the info level.
 | |
| 
 | |
| Read [the configuration guide](../Getting-Started/Configuration-Guide.md#sizes)
 | |
| for more details on size type parameters in MaxScale.
 | |
| 
 | |
| ### `optimistic_trx`
 | |
| 
 | |
| Enable optimistic transaction execution. This parameter controls whether normal
 | |
| transactions (i.e. `START TRANSACTION` or `BEGIN`) are load balanced across
 | |
| slaves. This feature is disabled by default and enabling it implicitly enables
 | |
| `transaction_replay`, `delayed_retry` and `master_reconnection` parameters.
 | |
| 
 | |
| When this mode is enabled, all transactions are first attempted on slave
 | |
| servers. If the transaction contains no statements that modify data, it is
 | |
| completed on the slave. If the transaction contains statements that modify data,
 | |
| it is rolled back on the slave server and restarted on the master. The rollback
 | |
| is initiated the moment a data modifying statement is intercepted by
 | |
| readwritesplit so only read-only statements are executed on slave servers.
 | |
| 
 | |
| As with `transaction_replay` and transactions that are replayed, if the results
 | |
| returned by the master server are not identical to the ones returned by the
 | |
| slave up to the point where the first data modifying statement was executed, the
 | |
| connection is closed. If the execution of ROLLBACK statement on the slave fails,
 | |
| the connection to that slave is closed.
 | |
| 
 | |
| All limitations that apply to `transaction_replay` also apply to
 | |
| `optimistic_trx`.
 | |
| 
 | |
| ### `causal_reads`
 | |
| 
 | |
| Enable causal reads. This parameter is disabled by default and was introduced in
 | |
| MaxScale 2.3.0.
 | |
| 
 | |
| If a client connection modifies the database and `causal_reads` is enabled, any
 | |
| subsequent reads performed on slave servers will be done in a manner that
 | |
| prevents replication lag from affecting the results. This only applies to the
 | |
| modifications done by the client itself.
 | |
| 
 | |
| **Note:** This feature requires MariaDB 10.2.16 or newer to function. In
 | |
|   addition to this, the `session_track_system_variables` parameter must be set
 | |
|   to `last_gtid`.
 | |
| 
 | |
| **Note:** This feature does not work with Galera or any other non-standard
 | |
|   replication mechanisms. As Galera does not update the `gtid_slave_pos`
 | |
|   variable when events are replicated via the Galera library, the
 | |
|   [`MASTER_GTID_WAIT`](https://mariadb.com/kb/en/library/master_gtid_wait/)
 | |
|   function used by MaxScale to synchronize reads will wait until the
 | |
|   timeout. With Galera this is not a serious issue as it, by nature, is a
 | |
|   mostly-synchronous replication mechanism.
 | |
| 
 | |
| A practical example can be given by the following set of SQL commands executed
 | |
| with `autocommit=1`.
 | |
| 
 | |
| ```sql
 | |
| INSERT INTO test.t1 (id) VALUES (1);
 | |
| SELECT * FROM test.t1 WHERE id = 1;
 | |
| ```
 | |
| 
 | |
| As the statements are not executed inside a transaction, from the load balancers
 | |
| point of view, the latter statement can be routed to a slave server. The problem
 | |
| with this is that if the value that was inserted on the master has not yet
 | |
| replicated to the server where the SELECT statement is being performed, it can
 | |
| appear as if the value we just inserted is not there.
 | |
| 
 | |
| By prefixing these types of SELECT statements with a command that guarantees
 | |
| consistent results for the reads, read scalability can be improved without
 | |
| sacrificing consistency.
 | |
| 
 | |
| The set of example SQL above will be translated by MaxScale into the following
 | |
| statements.
 | |
| 
 | |
| ```sql
 | |
| INSERT INTO test.t1 (id) VALUES (1);
 | |
| SET @maxscale_secret_variable=(
 | |
|     SELECT CASE
 | |
|            WHEN MASTER_GTID_WAIT('0-3000-8', 10) = 0 THEN 1
 | |
|            ELSE (SELECT 1 FROM INFORMATION_SCHEMA.ENGINES)
 | |
|     END);
 | |
| SELECT * FROM test.t1 WHERE id = 1;
 | |
| ```
 | |
| 
 | |
| The `SET` command will synchronize the slave to a certain logical point in
 | |
| the replication stream (see
 | |
| [MASTER_GTID_WAIT](https://mariadb.com/kb/en/library/master_gtid_wait/)
 | |
| for more details).
 | |
| 
 | |
| If the slave has not caught up to the master within the configured time, it will
 | |
| be retried on the master. In MaxScale 2.3.0 an error was returned to the client
 | |
| when the slave timed out.
 | |
| 
 | |
| ### `causal_reads_timeout`
 | |
| 
 | |
| The timeout for the slave synchronization done by `causal_reads`. The
 | |
| default value is 10 seconds.
 | |
| 
 | |
| ## Routing hints
 | |
| 
 | |
| The readwritesplit router supports routing hints. For a detailed guide on hint
 | |
| syntax and functionality, please read [this](../Reference/Hint-Syntax.md)
 | |
| document.
 | |
| 
 | |
| **Note**: Routing hints will always have the highest priority when a routing
 | |
| decision is made. This means that it is possible to cause inconsistencies in
 | |
| the session state and the actual data in the database by adding routing hints
 | |
| to DDL/DML statements which are then directed to slave servers. Only use routing
 | |
| hints when you are sure that they can cause no harm.
 | |
| 
 | |
| ## Limitations
 | |
| 
 | |
| For a list of readwritesplit limitations, please read the
 | |
| [Limitations](../About/Limitations.md) document.
 | |
| 
 | |
| ## Legacy Configuration
 | |
| 
 | |
| In older versions of MaxScale, routers were configured via the _router_options_
 | |
| parameter. This functionality was deprecated in 2.2 and was removed in 2.3.
 | |
| 
 | |
| ## Examples
 | |
| 
 | |
| Examples of the readwritesplit router in use can be found in the
 | |
| [Tutorials](../Tutorials) folder.
 | |
| 
 | |
| ## Readwritesplit routing decisions
 | |
| 
 | |
| Here is a small explanation which shows what kinds of queries are routed to
 | |
| which type of server.
 | |
| 
 | |
| ### Routing to Master
 | |
| 
 | |
| Routing to master is important for data consistency and because majority of
 | |
| writes are written to binlog and thus become replicated to slaves.
 | |
| 
 | |
| The following operations are routed to master:
 | |
| 
 | |
| * write statements,
 | |
| * all statements within an open transaction,
 | |
| * stored procedure calls
 | |
| * user-defined function calls
 | |
| * DDL statements (`DROP`|`CREATE`|`ALTER TABLE` … etc.)
 | |
| * `EXECUTE` (prepared) statements that modify the database
 | |
| * all statements using temporary tables
 | |
| 
 | |
| In addition to these, if the **readwritesplit** service is configured with the
 | |
| `max_slave_replication_lag` parameter, and if all slaves suffer from too much
 | |
| replication lag, then statements will be routed to the _Master_. (There might be
 | |
| other similar configuration parameters in the future which limit the number of
 | |
| statements that will be routed to slaves.)
 | |
| 
 | |
| ### Routing to Slaves
 | |
| 
 | |
| The ability to route some statements to slaves is important because it also
 | |
| decreases the load targeted to master. Moreover, it is possible to have multiple
 | |
| slaves to share the load in contrast to single master.
 | |
| 
 | |
| Queries which can be routed to slaves must be auto committed and belong to one
 | |
| of the following group:
 | |
| 
 | |
| * read-only database queries,
 | |
| * read-only queries to system, or user-defined variables,
 | |
| * `SHOW` statements
 | |
| * system function calls.
 | |
| 
 | |
| ### Routing to every session backend
 | |
| 
 | |
| A third class of statements includes those which modify session data, such as
 | |
| session system variables, user-defined variables, the default database, etc. We
 | |
| call them session commands, and they must be replicated as they affect the
 | |
| future results of read and write operations. They must be executed on all
 | |
| servers that could execute statements on behalf of this client.
 | |
| 
 | |
| Session commands include for example:
 | |
| 
 | |
| * `SET` statements
 | |
| * `USE `*`<dbname>`*
 | |
| * system/user-defined variable assignments embedded in read-only statements, such
 | |
| as `SELECT (@myvar := 5)`
 | |
| * `PREPARE` statements
 | |
| * `QUIT`, `PING`, `STMT RESET`, `CHANGE USER`, etc. commands
 | |
| 
 | |
| **NOTE**: if variable assignment is embedded in a write statement it is routed
 | |
| to _Master_ only. For example, `INSERT INTO t1 values(@myvar:=5, 7)` would be
 | |
| routed to _Master_ only.
 | |
| 
 | |
| The router stores all of the executed session commands so that in case of a
 | |
| slave failure, a replacement slave can be chosen and the session command history
 | |
| can be repeated on that new slave. This means that the router stores each
 | |
| executed session command for the duration of the session. Applications that use
 | |
| long-running sessions might cause MariaDB MaxScale to consume a growing amount
 | |
| of memory unless the sessions are closed. This can be solved by adjusting the
 | |
| value of `max_sescmd_history`.
 |