Threading Client¶
The threading client is a high-performance client that uses Python’s built-in threading module to manage concurrency.
Tutorial¶
In a separate shell, start one of caproto’s demo IOCs.
$ python3 -m caproto.ioc_examples.random_walk
PVs: ['random_walk:dt', 'random_walk:x']
Connect¶
Now, in Python we will talk to it using caproto’s threading client. Start by
creating a Context
.
In [1]: from caproto.threading.client import Context
In [2]: ctx = Context()
The Context
object caches connections, manages automatic
re-connection, and tracks the state of connections in progress.
We can use it to request new connections. Formulating requests for many PVs in
a large batch is efficient. In this example we’ll just ask for two PVs.
In [3]: x, dt = ctx.get_pvs('random_walk:x', 'random_walk:dt')
Context.get_pvs()
accepts an arbitrary number of PV names and
immediately returns a collection of PV
objects representing each name.
In a background thread, the Context searches for an EPICS server that provides
that PV name and then connects to it. The PV object displays its connection
state:
In [4]: dt
Out[4]: <PV name='random_walk:dt' priority=0 address=('192.168.2.99', 51648), circuit_state=States.CONNECTED, channel_state=States.CONNECTED>
The Context displays aggregate information about the state of all connections.
In [5]: ctx
Out[5]: <Context searches_pending=0 circuits=1 pvs=2 idle=0>
Read¶
Now, to read a PV:
In [6]: res = dt.read()
In [7]: res
Out[7]: ReadNotifyResponse(data=array([1.]), data_type=<ChannelType.DOUBLE: 6>, data_count=1, status=CAStatusCode(name='ECA_NORMAL', code=0, code_with_severity=1, severity=<CASeverity.SUCCESS: 1>, success=1, defunct=False, description='Normal successful completion'), ioid=0, metadata=None)
This object is a human-friendly representation of the server’s response. The raw bytes of that response are:
In [8]: bytes(res)
Out[8]: b'\x00\x0f\x00\x08\x00\x06\x00\x01\x00\x00\x00\x01\x00\x00\x00\x00?\xf0\x00\x00\x00\x00\x00\x00'
Access particular fields in the response using attribute (“dot”) access on res
.
In [9]: res.data
Out[9]: array([1.])
Note
Performance Note
The underlying metadata and data are stored in efficient, contiguous-memory data structures.
In [10]: res.header # a ctypes.BigEndianStructure Out[10]: MessageHeader(command=15, payload_size=8, data_type=6, data_count=1, parameter1=1, parameter2=0) In [11]: res.buffers # a collection of one or more buffers Out[11]: (b'', <memory at 0x7fef90722708>)
They were received directly from the socket into these structure with no
intermediate copies. Accessing the res.data
— which returns a
numpy.ndarray
or array.array
— provides a view onto that same
memory with no copying (if the data was received from the socket all at
once) or one copy (if the data bridged multiple receipts).
Write¶
Let us set the value to 1
.
In [12]: dt.write([1])
Out[12]: WriteNotifyResponse(data_type=<ChannelType.DOUBLE: 6>, data_count=1, status=CAStatusCode(name='ECA_NORMAL', code=0, code_with_severity=1, severity=<CASeverity.SUCCESS: 1>, success=1, defunct=False, description='Normal successful completion'), ioid=1)
By default, we send WriteNotifyResponse
, wait for a response, and return
it. There are a couple other ways we can handle writes:
Return immediately, not asking for or waiting for a response.
dt.write([1], wait=False)
Return immediately, not waiting for a response, but handing the response (when it arrived) to some callback function, processed on a background thread.
def f(response): print('got a response:', response) dt.write([1], wait=False, callback=f)
See the PV.write()
for more.
Subscribe (“Monitor”)¶
Let us now monitor a channel. The server updates the random_walk:x
channel
periodically at some period set by random_walk:dt
. We can subscribe to
updates and fan them out to one or more user-defined callback functions.
First, we define a Subscription
.
In [13]: sub = x.subscribe()
Next, we define a callback function, a function that will be called whenever the server sends an update. It must accept one positional argument.
In [14]: responses = []
In [15]: def f(response):
....: responses.append(response)
....:
We register this function with sub
.
In [16]: token = sub.add_callback(f)
The token
is just an integer which we can use to remove f
later. We can
define a second callback:
In [17]: values = []
In [18]: def g(response):
....: values.append(response.data[0])
....:
and add it to the same subscription, putting no additional load on the network.
In [19]: sub.add_callback(g)
Out[19]: 1
After some time has passed, we will have accumulated some responses.
In [20]: len(responses) Out[20]: 6 In [21]: values Out[21]: [1.1281205425353997, 1.6860106441088352, 0.8072963150936638, 0.052539181626817255, -0.09686981917873716, -0.1663071637826916]
At any point we can remove a specific callback function:
In [22]: sub.remove_callback(token)
or clear all the callbacks on a subscription:
In [23]: sub.clear()
In order to minimize load on the network, a Subscription
waits to
request updates from the server until the first user callback is added. Thus,
the first callback added by the user is guaranteed to get the first response
received from the server. If all user callbacks are later removed, either
explicitly (via remove_callback
or clear
) or implicitly via Python
garbage collection, the Subscription automatically cancels future updates from
the server. If a callback is then later added, the Subscription silently
re-initiates updates. All of this is transparent to the user.
Warning
The callback registry in Subscription
only holds weak references
to the user callback functions. If there are no other references to the
function, it will be silently garbage collected and removed. Therefore,
constructions like this do not work:
sub.add_callback(lambda response: print(response.data))
The lambda function will be promptly garbage collected by Python and
removed from sub
by caproto. To avoid that, make a reference before
passing the function to Subscription.add_callback()
.
cb = lambda response: print(response.data)
sub.add_callback(cb)
This can be surprising, but it is a standard approach for avoiding the accidental costly accumulation of abandoned callbacks.
This pitfall does not apply to callbacks passed to PV.read()
and
PV.write()
(or Batch.read()
and Batch.write()
) because
those are single-shot callbacks that do not persist beyond their first use.
Batched Requests¶
Batching requests is efficient, and it sets up the server to perform these
operations as closely-spaced in time as possible, within the limits of the
protocol. Python’s with
syntax provides a natural way to specify a batch of
requests — reads, writes, or a mixture of both — and execute them upon exit
from the with
block.
Suppose we had a list of PV
objects, pvs
, and we want to request
readings in bulk. We can use a callback function to stash readings in a
dictionary as they arrive.
For convenience we’ll demonstrate this using the two PVs we have handy —
x
and dt
— but an unlimited number may be used.
In [24]: pvs = [x, dt]
In [25]: from functools import partial
In [26]: results = {}
In [27]: def stash_result(name, response):
....: results[name] = response.data
....:
Now we’ll use the Batch
context:
In [28]: from caproto.threading.client import Batch
In [29]: with Batch() as b:
....: for pv in pvs:
....: b.read(pv, partial(stash_result, pv.name))
....:
The requests will be sent in large batches (over the PVs’ respective circuits)
upon exiting the with
block. The responses will be processed on a
background thread. As the responses come in, the results
dictionary will be
updated.
In [30]: results
Out[30]: {'random_walk:x': array([-0.16630716]), 'random_walk:dt': array([1.])}
See Batch
for more.
Go Idle¶
Once created, PVs are cached for the lifetime of the Context
and
returned again to the user if a PV with the same name and priority is
requested. In order to reduce the load on the network, a PV can be temporarily
made “idle” (disconnected). It will silently, automatically reconnect the next
time it is used.
In [31]: x Out[31]: <PV name='random_walk:x' priority=0 address=('192.168.2.99', 51648), circuit_state=States.CONNECTED, channel_state=States.CONNECTED> In [32]: x.go_idle() In [33]: x Out[33]: <PV name='random_walk:x' priority=0 (idle)> In [34]: x.read() Out[34]: ReadNotifyResponse(data=array([0.12345886]), data_type=<ChannelType.DOUBLE: 6>, data_count=1, status=CAStatusCode(name='ECA_NORMAL', code=0, code_with_severity=1, severity=<CASeverity.SUCCESS: 1>, success=1, defunct=False, description='Normal successful completion'), ioid=4, metadata=None) In [35]: x Out[35]: <PV name='random_walk:x' priority=0 address=('192.168.2.99', 51648), circuit_state=States.CONNECTED, channel_state=States.CONNECTED>
Notice that when the PV was read it automatically reconnected, requiring no action from the user.
The go_idle()
method is merely a request and is not guaranteed to have
any effect. If a PV has active subscriptions, it will ignore the request: it
must stay active to continue servicing user callbacks. Therefore, it is safe
call go_idle()
on any PV at any time, knowing that the PV will decline to
disconnect if it is being actively used and that, if it is currently unused, it
will transparently reconnect the next time it is used.
Canceling Searches¶
All unanswered searches are retried repeatedly, with decreasing frequency,
forever. Each new call to get_pvs()
causes all unanswered
searches to be retried at least once immediately. This can generate unwanted
network traffic. To fully cancel a search that is never expected to complete,
access the method SharedBroadcaster.cancel
.
ctx.broadcaster.cancel('some typo-ed PV name, for example')
As the name suggests, it is possible to construct multiple Contexts that share one SharedBroadcaster. In that scenario, notice that canceling the search will affect all contexts using the SharedBroadcaster.
Events Off and On¶
If a given circuit produces updates faster than a client can process them, the client can suspend subscriptions on that circuit. This will causes the server to discard all backlogged updates and all new updates during the period of supsension. When the client reactives subscriptions, it will immediate receive the most recent update and then any future updates.
x.circuit_manager.events_off()
...
x.circuit_manager.events_on()
Server Health Check¶
To check how much time has passed (in seconds) since each known server was last heard from, use:
ctx.broadcaster.time_since_last_heard()
As a convenience, check on the server connected to a specific PV using:
x.time_since_last_heard()
See the SharedBroadcaster.time_since_last_heard()
API documentation below
for details.
API Documentation¶
-
class
caproto.threading.client.
Context
(broadcaster=None, *, timeout=2, host_name=None, client_name=None, max_workers=1)[source]¶ Encapsulates the state and connections of a client
Parameters: - broadcaster : SharedBroadcaster, optional
If None is specified, a fresh one is instantiated.
- timeout : number or None, optional
Number of seconds before a CaprotoTimeoutError is raised. This default can be overridden at the PV level or for any given operation. If unset, the default is 2 seconds. If None, never timeout. A global timeout can be specified via an environment variable
CAPROTO_DEFAULT_TIMEOUT
.- host_name : string, optional
uses value of
socket.gethostname()
by default- client_name : string, optional
uses value of
getpass.getuser()
by default- max_workers : integer, optional
Number of worker threaders per VirtualCircuit for executing user callbacks. Default is 1. For any number of workers, workers will receive updates in the order which they are received from the server. That is, work on each update will begin in sequential order. Work-scheduling internal to the user callback is outside caproto’s control. If the number of workers is set to greater than 1, the work on each update may not finish in a deterministic order. For example, if workers are writing lines into a file, the only way to guarantee that the lines are ordered properly is to use only one worker. If ordering matters for your application, think carefully before increasing this value from 1.
-
get_pvs
(*names, priority=0, connection_state_callback=None, access_rights_callback=None, timeout=<object object>)[source]¶ Return a list of PV objects.
These objects may not be connected at first. Channel creation occurs on a background thread.
PVs are uniquely defined by their name and priority. If a PV with the same name and priority is requested twice, the same (cached) object is returned. Any callbacks included here are added to added alongside any existing ones.
Parameters: - *names : strings
any number of PV names
- priority : integer
Used by the server to triage subscription responses when under high load. 0 is lowest; 99 is highest.
- connection_state_callback : callable
Expected signature:
f(pv, state)
wherepv
is the instance ofPV
whose state has changed andstate
is a string- access_rights_callback : callable
Expected signature:
f(pv, access_rights)
wherepv
is the instance ofPV
whose state has changed andaccess_rights
is a member of the caprotoAccessRights
enum- timeout : number or None, optional
Number of seconds before a CaprotoTimeoutError is raised. This default can be overridden for any specific operation. By default, fall back to the default timeout set by the Context. If None, never timeout.
-
class
caproto.threading.client.
PV
(name, priority, context, connection_state_callback, access_rights_callback, timeout)[source]¶ Represents one PV, specified by a name and priority.
This object may exist prior to connection and persists across any subsequent re-connections.
This object should never be instantiated directly by user code; rather it should be created by calling the
get_pvs
method on aContext
object.-
go_idle
()[source]¶ Request to clear this Channel to reduce load on client and server.
A new Channel will be automatically, silently created the next time any method requiring a connection is called. Thus, this saves some memory in exchange for making the next request a bit slower, as it has to redo the handshake with the server first.
If there are any subscriptions with callbacks, this request will be ignored. If the PV is in the process of connecting, this request will be ignored. If there are any actions in progress (read, write) this request will be processed when they are complete.
-
read
(*, wait=True, callback=None, timeout=<object object>, data_type=None, data_count=None, notify=True)[source]¶ Request a fresh reading.
Can do one or both of: - Block while waiting for the response, and return it. - Pass the response to callback, with or without blocking.
Parameters: - wait : boolean
If True (default) block until a matching response is received from the server. Raises CaprotoTimeoutError if that response is not received within the time specified by the timeout parameter.
- callback : callable or None
Called with the response as its argument when received.
- timeout : number or None, optional
Seconds to wait before a CaprotoTimeoutError is raised. Default is
PV.timeout
, which falls back toPV.context.timeout
if not set. If None, never timeout.- data_type : {‘native’, ‘status’, ‘time’, ‘graphic’, ‘control’} or ChannelType or int ID, optional
Request specific data type or a class of data types, matched to the channel’s native data type. Default is Channel’s native data type.
- data_count : integer, optional
Requested number of values. Default is the channel’s native data count.
- notify: boolean, optional
Send a
ReadNotifyRequest
instead of aReadRequest
. True by default.
-
subscribe
(data_type=None, data_count=None, low=0.0, high=0.0, to=0.0, mask=None)[source]¶ Start a new subscription to which user callback may be added.
Parameters: - data_type : {‘native’, ‘status’, ‘time’, ‘graphic’, ‘control’} or ChannelType or int ID, optional
Request specific data type or a class of data types, matched to the channel’s native data type. Default is Channel’s native data type.
- data_count : integer, optional
Requested number of values. Default is the channel’s native data count.
- low, high, to : float, optional
deprecated by Channel Access, not yet implemented by caproto
- mask : SubscriptionType, optional
Subscribe to selective updates.
Returns: - subscription : Subscription
Examples
Define a subscription.
>>> sub = pv.subscribe()
Add a user callback. The subscription will be transparently activated (i.e. an
EventAddRequest
will be sent) when the first user callback is added.>>> sub.add_callback(my_func)
Multiple callbacks may be added to the same subscription.
>>> sub.add_callback(another_func)
See the docstring for
Subscription
for more.
-
time_since_last_heard
(timeout=<object object>)[source]¶ Seconds since last message from the server that provides this channel.
The time is reset to 0 whenever we receive a TCP message related to user activity or a Beacon. Servers are expected to send Beacons at regular intervals. If we do not receive either a Beacon or TCP message, we initiate an Echo over TCP, to which the server is expected to promptly respond.
Therefore, the time reported here should not much exceed
EPICS_CA_CONN_TMO
(default 30 seconds unless overriden by that environment variable) if the server is healthy.If the server fails to send a Beacon on schedule and fails to reply to an Echo, the server is assumed dead. A warning is issued, and all PVs are disconnected to initiate a reconnection attempt.
Parameters: - timeout : number or None, optional
Seconds to wait before a CaprotoTimeoutError is raised. Default is
PV.timeout
, which falls back toPV.context.timeout
if not set. If None, never timeout.
-
timeout
¶ Effective default timeout.
Valid values are: * CONTEXT_DEFAULT_TIMEOUT (fall back to Context.timeout) * a floating-point number * None (never timeout)
-
wait_for_connection
(*, timeout=<object object>)[source]¶ Wait for this PV to be connected.
Parameters: - timeout : number or None, optional
Seconds to wait before a CaprotoTimeoutError is raised. Default is
PV.timeout
, which falls back toPV.context.timeout
if not set. If None, never timeout.
-
wait_for_search
(*, timeout=<object object>)[source]¶ Wait for this PV to be found.
This does not wait for the PV’s Channel to be created; it merely waits for an address (and a VirtualCircuit) to be assigned.
Parameters: - timeout : number or None, optional
Seconds to wait before a CaprotoTimeoutError is raised. Default is
PV.timeout
, which falls back to Context.timeout if not set. If None, never timeout.
-
write
(data, *, wait=True, callback=None, timeout=<object object>, notify=None, data_type=None, data_count=None)[source]¶ Write a new value. Optionally, request confirmation from the server.
Can do one or both of: - Block while waiting for the response, and return it. - Pass the response to callback, with or without blocking.
Parameters: - data : str, int, or float or any Iterable of these
Value(s) to write.
- wait : boolean
If True (default) block until a matching WriteNotifyResponse is received from the server. Raises CaprotoTimeoutError if that response is not received within the time specified by the timeout parameter.
- callback : callable or None
Called with the WriteNotifyResponse as its argument when received.
- timeout : number or None, optional
Seconds to wait before a CaprotoTimeoutError is raised. Default is
PV.timeout
, which falls back toPV.context.timeout
if not set. If None, never timeout.- notify : boolean or None
If None (default), set to True if wait=True or callback is set. Can be manually set to True or False. Will raise ValueError if set to False while wait=True or callback is set.
- data_type : {‘native’, ‘status’, ‘time’, ‘graphic’, ‘control’} or ChannelType or int ID, optional
Write specific data type or a class of data types, matched to the channel’s native data type. Default is Channel’s native data type.
- data_count : integer, optional
Requested number of values. Default is the channel’s native data count.
-
-
class
caproto.threading.client.
Subscription
(pv, data_type, data_count, low, high, to, mask)[source]¶ Represents one subscription, specified by a PV and configurational parameters
It may fan out to zero, one, or multiple user-registered callback functions.
This object should never be instantiated directly by user code; rather it should be made by calling the
subscribe()
method on aPV
object.-
add_callback
(func)[source]¶ Add a callback to receive responses.
Parameters: - func : callable
Expected signature:
func(response)
Returns: - token : int
Integer token that can be passed to
remove_callback()
.
-
remove_callback
(token)[source]¶ Remove callback using token that was returned by
add_callback()
.Parameters: - token : integer
Token returned by
add_callback()
.
-
-
class
caproto.threading.client.
Batch
(timeout=2)[source]¶ Accumulate requests and then issue them all in batch.
Parameters: - timeout : number or None
Number of seconds to wait before ignoring late responses. Default is 2.
Examples
Read some PVs in batch and stash the readings in a dictionary as they come in.
>>> results = {} >>> def stash_result(name, response): ... results[name] = response.data ... >>> with Batch() as b: ... for pv in pvs: ... b.read(pv, functools.partial(stash_result, pv.name)) ... # The requests are sent upon exiting this 'with' block. ...
The
results
dictionary will be populated as responses come in.-
read
(pv, callback, data_type=None, data_count=None)[source]¶ Request a fresh reading as part of a batched request.
Notice that, unlike
PV.read()
, the callback is required. (There is no other way to get the result back from a batched read.)Parameters: - pv : PV
- callback : callable
Expected signature:
f(response)
- data_type : {‘native’, ‘status’, ‘time’, ‘graphic’, ‘control’} or ChannelType or int ID, optional
Request specific data type or a class of data types, matched to the channel’s native data type. Default is Channel’s native data type.
- data_count : integer, optional
Requested number of values. Default is the channel’s native data count.
-
write
(pv, data, callback=None, data_type=None, data_count=None)[source]¶ Write a new value as part of a batched request.
Parameters: - pv : PV
- data : str, int, or float or any Iterable of these
Value(s) to write.
- callback : callable
Expected signature:
f(response)
- data_type : {‘native’, ‘status’, ‘time’, ‘graphic’, ‘control’} or ChannelType or int ID, optional
Request specific data type or a class of data types, matched to the channel’s native data type. Default is Channel’s native data type.
- data_count : integer, optional
Requested number of values. Default is the channel’s native data count.
The following are internal components. There API may change in the future.
-
class
caproto.threading.client.
VirtualCircuitManager
(context, circuit, selector, timeout=2)[source]¶ Encapsulates a VirtualCircuit, a TCP socket, and additional state
This object should never be instantiated directly by user code. It is used internally by the Context. Its methods may be touched by user code, but this is rarely necessary.
Cancel searches for these names.
Parameters: - *names : strings
any number of PV names
- Any PV instances that were awaiting these results will be stuck until
- :meth:`get_pvs` is called again.
Returns address if found, raises KeyError if missing or stale.
Receive and process and next command broadcasted over UDP.
Search for PV names.
The
results_queue
will receive(address, names)
(the address of a server and a list of name(s) that it has) when results are received.If a cached result is already known, it will be put immediately into
results_queue
from this thread during this method’s execution.If not, a SearchRequest will be sent from another thread. If necessary, the request will be re-sent periodically. When a matching response is received (by yet another thread)
(address, names)
will be put into theresults_queue
.
Force the Broadcaster to reissue all unanswered search requests now.
Left to its own devices, the Broadcaster will do this at regular intervals automatically. This method is intended primarily for debugging and should not be needed in normal use.
Process a command and transport it over the UDP socket.
Map each known server address to seconds since its last message.
The time is reset to 0 whenever we receive a TCP message related to user activity or a Beacon. Servers are expected to send Beacons at regular intervals. If we do not receive either a Beacon or TCP message, we initiate an Echo over TCP, to which the server is expected to promptly respond.
Therefore, the time reported here should not much exceed
EPICS_CA_CONN_TMO
(default 30 seconds unless overriden by that environment variable) if the server is healthy.If the server fails to send a Beacon on schedule and fails to reply to an Echo, the server is assumed dead. A warning is issued, and all PVs are disconnected to initiate a reconnection attempt.