CS 452/652 Winter 2024 - Lecture 16

Latency

Mar 7, 2024 prev next

compute latency (vs. Märklin latency)
think of program as graph
- computation & termination vs. control/service loop
control/service loop: throughput vs. latency
- timing of edges: busy vs. block
- throughput: compare average busy cost of paths to offered load / arrival rate
- latency: take into account busy and blocked cost

no balancing between lower and higher latencies
- service loop: outliers change mean - potentially skews utility
  - late response is useless, regardless of how late
- control loop: mean masks outliers - potentially hides catastrophic problem
  - how often a train enters a critical track section matters more than how much it overshoots
analysis must work with latency distribution
- service loop: high order percentile (tail latency)
- control loop: worst-case latency
exceptions where average latency is relevant
- sizing of streaming playout buffer

worst-case execution time (WCET)
identify (safety-)critical paths or worst-case execution path (WCEP)
- high order percentile vs. actual worst-case?
uncertainty
- busy: memory pipelining/caching/sharing effects?
- cache: no miss, one miss, all misses -> reality?
- block: contributes to latency, but not throughput
- internal loops - bounds?
queueing
- standing queue - contributes to latency, but does not affect throughput
- queueing theory: latency increases as arrivals approach capacity
  - cannot "save" capacity during low-arrival times for later high-arrival
  - unbounded arrivals, average over infinity
alternative model: Critical Instant
- assume all higher-prio events happen at the same time
  → trace priority execution

critical path: sensor event → compute → action → effect
- includes Märklin latency
small tasks: shorter program graph edges
preemptive scheduling: shortcut to loop start in program graph
preemption: keep uninterruptible kernel execution short!
priorities: resource management via scheduling

priorities are a mechanism to organize various components of a computer system
critical path: sensor → uart → supervisor → engineer → track → uart → command
- plus requisite notifiers, couriers, etc.
critical path? everything is important! what can be bumped down?
- computation such as routing and path planning?
at low utilization: priorities not as critical
priority determines worst-case latency
- low-level tasks: low priority could loose events or input
- use higher priority and/or buffer

assume CommandWriter is highest priority server
latency includes
- SRR for request (includes a CS)
- request write
- Await interrupt (for CTS) - interrupt handle and another CS (unblock)
variability:
- assume CommandWriter runs immediately after interrupt, because it is highest priority
  - otherwise, we need to consider additional delay due to higher priority task(s)
- interrupts might occur while CommandWriter is running?
  - can try to estimate/bound number
  - this is why we want interrupt handling to be fast and predictable!
might need to measure primitive steps
- have already measured SRR, including CS
- easy to measure command writing time, by polling/being interrupted by CTS
- measure interrupt overhead?
  - in the kernel, using always-asserted interrupt, count and measure N occurrences

track monitor sends poll command to CommandWriter, then waits for data from sensor reader, records result in memory
- we've already estimated time for command execution
- after command executes, monitor needs to wait for message from sensor reader
  - need to estimate time for sensor reader
  - in addition, when sensor message arrives at monitor:
    - monitor might be busy handling another request
    - other request messages might be waiting ahead of sensor report
  - even when data is available, monitor may not be running, depending on priorty
strategy:
- try to estimate best- and worst- case times, using application knowledge
  - e.g., at most one message from CommandDriver, at most one from train driver?
  - what higher-priority tasks could be runnable?