Documentation: migrate "Critical Section Monitor" from wiki

links: https://cwiki.apache.org/confluence/display/NUTTX/Critical+Section+Monitor
2023-11-04 21:59:24 +01:00 · 2023-11-04 21:59:24 +01:00 · fdc671fa3e
parent f96064db75
commit fdc671fa3e
1 changed files with 269 additions and 0 deletions
--- a/Documentation/implementation/critical_sections.rst
+++ b/Documentation/implementation/critical_sections.rst
@ -2,6 +2,40 @@
 Critical Sections
 =================
 Types and Effects of Critical Sections
 ======================================
 A critical section is a short sequence of code where exclusive execution is
 assured by globally disabling other activities while that code sequence executes.
 When we discuss critical sections here we really refer to one of two mechanisms:
 * **Critical Section proper** A critical section is established by calling
  ``enter_critical_section()``; the code sequence exits the critical section by
  calling ``leave_critical_section()``. For the single CPU case, this amounts to
  simply disabling interrupts but is more complex in the SMP case where spinlocks
  are also involved.
 * **Disabling Pre-emption** This is a related mechanism that is lumped into this
  discussion because of the similarity of its effects on the system. When pre-emption
  is disabled (via ``sched_lock()``), interrupts remain enabled, but context switches
  may not occur; the current task is locked in place and cannot be suspended until
  the scheduler is unlocked (via ``sched_unlock()``).
 The use of either mechanism will always harm real-time performance.
 The effects of critical sections on real-time performance is discussed in
 `Effects of Disabling Interrupts or Pre-Emption on Response Latency <https://cwiki.apache.org/confluence/display/NUTTX/Effects+of+Disabling+Interrupts+or+Pre-Emption+on+Response+Latency>`_ [TODO: move to documentation].
 The end result is that a certain amount of **jitter** is added to the real-time response.
 Critical sections cannot be avoided within the OS and, as a consequence, a certain
 amount of "jitter" in the response time is expected. The important thing is to monitor
 the maximum time that critical sections are in place in order to manage that jitter so
 that the variability in response time is within an acceptable range.
 NOTE: This discussion applies to Normal interrupt processing. Most of this discussion
 does not apply to :doc:`/guides/zerolatencyinterrupts`. Those interrupts are not masked
 in the same fashion and none of the issues address in this page apply to those
 interrupts.
 Single CPU Critical Sections
 ============================
@ -161,3 +195,238 @@ the single CPU case. Here are the caveats:
  themselves at any time (say, via ``sleep()``). In that case, only the CPU's
  IDLE task will be permitted to run.
 The Critical Section Monitor
 ============================
 Internal OS Hooks
 -----------------
 **The Critical Section Monitor**
 In order to measure the time that tasks hold critical sections, the OS supports
 a Critical Section Monitor. This is internal instrumentation that records the
 time that a task holds a critical section. It also records the amount of time
 that interrupts are disabled globally. The Critical Section Monitor then retains
 the maximum time that the critical section is in place, both per-task and globally.
 The Critical Section Monitor is enabled with the following setting in the
 configuration::
  CONFIG_SCHED_CRITMONITOR=y
 **Perf Timers interface**
 .. todo:: missing description for perf_xxx interface
 **Per Thread and Global Critical Sections**
 In NuttX critical sections are controlled on a per-task basis. For example,
 consider the following code sequence:
 .. code-block:: C
   irqstate_t flags = enter_critical_section();
   sleep(5);
   leave_critical_section(flags);
 The task, say Task A, establishes the critical section with
 ``enter_critical_section()``, but when Task A is suspended by the ``sleep(5)``
 statement, it relinquishes the critical section. The state of the system will
 then be determined by the next task to be resumed, say Task B: Typically, the
 next task will not be in a critical section and so the critical section is
 broken while the task sleeps. That critical section will be re-established when
 that Task A runs again after the sleep time expires.
 However, if Task B that is resumed is also within a critical section, then the
 critical section will be extended even longer! This is why the global time that
 the critical section in place may be longer than any time that an individual
 thread holds the critical section.
 ProcFS
 ------
 The OS reports these maximum times via the ProcFS file system, typically
 mounted at ``/proc``:
 * The ``/proc/<ID>/critmon`` pseudo-file reports the per-thread maximum value
  for thread ID = <ID>. There is one instance of this critmon file for each
  active task in the system.
 * The ``/proc/critmon`` pseuo-file reports similar information for the global
  state of the CPU.
 The form of the output from the ``/proc/<ID>/critmon`` file is::
  X.XXXXXXXXX,X.XXXXXXXXX
 Where ``X.XXXXXXXXX`` is the time in seconds with nanosecond precision
 (but not necessarily accuracy, accuracy is dependent on the timing clock
 source). The first number is the maximum time that the held pre-emption
 disabled; the second number number is the longest duration that the critical
 section was held.
 This file cat be read from NSH like:
 .. code-block:: bash
   nsh> cat /proc/1/critmon
   0.000009610,0.000001165
 The form of the output from the ``/proc/critmon`` file is simlar::
  X,X.XXXXXXXXX,X.XXXXXXXXX
 Where the first X is the CPU number and the following two numbers have the
 same interpretation as for ``/proc/<ID>/critmon``. In the single CPU case,
 there will be one line in the pseudo-file with ``X=0``; in the SMP case
 there will be multiple lines, one for each CPU.
 This file can also be read from NSH:
 .. code-block:: bash
   nsh> cat /proc/critmon
   0,0.000009902,0.000023590
 These statistics are cleared each time that the pseudo-file is read so that
 the reported values are the maximum since the last time that the ProcFS pseudo
 file was read.
 ``apps/system/critmon``
 -----------------------
 Also available is a application daemon at ``apps/sysem/critmon``. This daemon
 periodically reads the ProcFS files described above and dumps the output to
 stdout. This daemon is enabled with:
 .. code-block:: bash
   nsh> critmon_start
   Csection Monitor: Started: 3
   Csection Monitor: Running: 3
   nsh>
   PRE-EMPTION CSECTION    PID   DESCRIPTION
   MAX DISABLE MAX TIME
   0.000100767 0.000005242  ---  CPU 0
   0.000000292 0.000023590     0 Idle Task
   0.000036696 0.000004078     1 init
   0.000000000 0.000014562     3 Csection Monitor
   ...
 And can be stopped with:
 .. code-block:: bash
   nsh> critmon_stop
   Csection Monitor: Stopping: 3
   Csection Monitor: Stopped: 3
 IRQ Monitor and Worst Case Response Time
 ========================================
 The IRQ Monitor is additional OS instrumentation. A full discusssion of the
 IRQ Monitor is beyond the scope of this page. Suffice it to say:
 * The IRQ Monitor is enabled with ``CONFIG_SCHED_IRQMONITOR=y``.
 * The data collected by the IRQ Monitor is provided in ``/proc/irqs``.
 * This data can also be viewed using the ``nsh> irqinfo`` command.
 * This data includes the number of interrupts received for each IRQ and the
  time required to process the interrupt, from entry into the attached
  interrupt handler until exit from the interrupt handler.
 From this information we can calculate the worst case response time from
 interrupt request until a task runs that can process the the interrupt.
 That worst cast response time, ``Tresp``, is given by:
 * ``Tresp1 = Tcrit + Tintr + C1``
 * ``Tresp2 = Tintr + Tpreempt + C2``
 * ``Tresp = MAX(Tresp1, Tresp2)``
 Where:
 * ``C1`` and ``C2`` are unknown, irreducible constants that reflect such things as
  hardware interrupt latency and context switching time,
 * ``Tcrit`` is the longest observed time within a critical section,
 * ``Tintr`` is the time required for interrupt handler execution for the event
  of interest, and
 * ``Tpreempt`` is the longest observed time with preemption disabled.
 NOTES:
 #. This calculation assumes that the task of interest is the highest priority task
   in the system. It does not consider the possibility of the responding task being
   delayed due to insufficient priority.
 #. This calculation does not address the case where the interfering task has both
   preemption disabled and holds the critical section. Certainly Tresp1 is valid
   in this case, but Tresp2 is not. There might some additional, unmeasured delay
   after the interrupt and before the responding task can run depending on the order
   in which the critical section is released and preemption is re-enabled:
     * When the task leaves the critical section, the pending interrupt will execute
       immediately with or without preemption enabled.
     * If preemption is enabled first, then the will be no delay after the interrupt
       because preemption will be enabled when the interrupt returns.
     * If the task leaves critical section first, then there will be some small delay
       of unknown duration after the interrupts returns and before the responding
       task can run because preemption will be disabled when the interrupt returns.
 #. This calculation does not address concurrent interrupts. All interrupts run at the
   same priority and if an interrupt request occurs while within an interrupt handler,
   then it must pend until completion of that interrupt. So perhaps the above formula
   for ``Tresp1`` should instead be the following? (This assumes that hardware arbitration
   is such that the interrupt of interest will be deferred by no more than one interrupt).
   Concurrent, nested interrupts might be better supported with prioritized.
   See more: :doc:`/guides/nestedinterrupts`.
     * ``Tresp1 = Tcrit + Tintrmax + Tintr + C1``
       Where:
       * ``Tintrmax`` is the longest interrupt processing time of all interrupt sources
         (excluding the interrupt for the event under consideration).
 What can you do?
 ----------------
 What can you do if the timing data indicates that you cannot meet your deadline?
 You have these options:
 #. Use these tools to find the exact function that holds the critical section or
   disables preemption too long. Then optimize that function so that it releases
   that resource sooner. Often critical sections are established over long sequences
   or code when they could be re-designed to use critical sections over shorter code
   sequences.
 #. In some cases, use of critical sections or disabling of pre-emption could replaced
   with a locking semaphore. The scope of the locking effect for the use of such locks
   is not global but is limited only to tasks that share the same resource. Critical
   sections should correctly be used only to protect resources that are shared between
   tasking level logic and interrupt level logic.
 #. Switch to :doc:`/guides/zerolatencyinterrupts`. Those interrupts are not subject
   to most of the issues discussed in this page.
 **NOTE**
 There are a few places in the OS were preemption is disabled via ``sched_lock()`` in
 order to establish a critical section. That is an incorrect use of ``sched_lock()``.
 ``sched_lock()`` simply prevents the currently executing task from being suspended.
 For the case of the single CPU platform, that does effectively create a critical
 section: Since no other task can run, the locking task does have exclusive access
 to all resources that are not shared with interrupt level logic.
 But in the multi-CPU SMP case that is not true. ``sched_lock()`` still keeps the
 current task running on CPU from being suspended, but it does not support any
 exclusivity in accesses because there will be other tasks running on other CPUs
 that may access the same resources.