VLIB (Vector Processing Library)
================================

The files associated with vlib are located in the ./src/{vlib, vlibapi,
vlibmemory} folders. These libraries provide vector processing support
including graph-node scheduling, reliable multicast support,
ultra-lightweight cooperative multi-tasking threads, a CLI, plug in .DLL
support, physical memory and Linux epoll support. Parts of this library
embody US Patent 7,961,636.

Init function discovery
-----------------------

vlib applications register for various [initialization] events by
placing structures and \__attribute__((constructor)) functions into the
image. At appropriate times, the vlib framework walks
constructor-generated singly-linked structure lists, performs a
topological sort based on specified constraints, and calls the indicated
functions. Vlib applications create graph nodes, add CLI functions,
start cooperative multi-tasking threads, etc. etc. using this mechanism.

vlib applications invariably include a number of VLIB_INIT_FUNCTION
(my_init_function) macros.

Each init / configure / etc. function has the return type clib_error_t
\*. Make sure that the function returns 0 if all is well, otherwise the
framework will announce an error and exit.

vlib applications must link against vppinfra, and often link against
other libraries such as VNET. In the latter case, it may be necessary to
explicitly reference symbol(s) otherwise large portions of the library
may be AWOL at runtime.

Init function construction and constraint specification
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

It’s easy to add an init function:

.. code:: c

      static clib_error_t *my_init_function (vlib_main_t *vm)
      {
         /* ... initialize things ... */

         return 0; // or return clib_error_return (0, "BROKEN!");
      }
      VLIB_INIT_FUNCTION(my_init_function);

As given, my_init_function will be executed “at some point,” but with no
ordering guarantees.

Specifying ordering constraints is easy:

.. code:: c

      VLIB_INIT_FUNCTION(my_init_function) =
      {
         .runs_before = VLIB_INITS("we_run_before_function_1",
                                   "we_run_before_function_2"),
         .runs_after = VLIB_INITS("we_run_after_function_1",
                                  "we_run_after_function_2),
       };

It’s also easy to specify bulk ordering constraints of the form “a then
b then c then d”:

.. code:: c

      VLIB_INIT_FUNCTION(my_init_function) =
      {
         .init_order = VLIB_INITS("a", "b", "c", "d"),
      };

It’s OK to specify all three sorts of ordering constraints for a single
init function, although it’s hard to imagine why it would be necessary.

Node Graph Initialization
-------------------------

vlib packet-processing applications invariably define a set of graph
nodes to process packets.

One constructs a vlib_node_registration_t, most often via the
VLIB_REGISTER_NODE macro. At runtime, the framework processes the set of
such registrations into a directed graph. It is easy enough to add nodes
to the graph at runtime. The framework does not support removing nodes.

vlib provides several types of vector-processing graph nodes, primarily
to control framework dispatch behaviors. The type member of the
vlib_node_registration_t functions as follows:

-  VLIB_NODE_TYPE_PRE_INPUT - run before all other node types
-  VLIB_NODE_TYPE_INPUT - run as often as possible, after pre_input
   nodes
-  VLIB_NODE_TYPE_INTERNAL - only when explicitly made runnable by
   adding pending frames for processing
-  VLIB_NODE_TYPE_PROCESS - only when explicitly made runnable.
   “Process” nodes are actually cooperative multi-tasking threads. They
   **must** explicitly suspend after a reasonably short period of time.

For a precise understanding of the graph node dispatcher, please read
./src/vlib/main.c:vlib_main_loop.

Graph node dispatcher
---------------------

Vlib_main_loop() dispatches graph nodes. The basic vector processing
algorithm is diabolically simple, but may not be obvious from even a
long stare at the code. Here’s how it works: some input node, or set of
input nodes, produce a vector of work to process. The graph node
dispatcher pushes the work vector through the directed graph,
subdividing it as needed, until the original work vector has been
completely processed. At that point, the process recurs.

This scheme yields a stable equilibrium in frame size, by construction.
Here’s why: as the frame size increases, the per-frame-element
processing time decreases. There are several related forces at work; the
simplest to describe is the effect of vector processing on the CPU L1
I-cache. The first frame element [packet] processed by a given node
warms up the node dispatch function in the L1 I-cache. All subsequent
frame elements profit. As we increase the number of frame elements, the
cost per element goes down.

Under light load, it is a crazy waste of CPU cycles to run the graph
node dispatcher flat-out. So, the graph node dispatcher arranges to wait
for work by sitting in a timed epoll wait if the prevailing frame size
is lo<style>
@media only all and (prefers-color-scheme: dark) {
.highlight .hll { background-color: #49483e }
.highlight .c { color: #75715e } /* Comment */
.highlight .err { color: #960050; background-color: #1e0010 } /* Error */
.highlight .k { color: #66d9ef } /* Keyword */
.highlight .l { color: #ae81ff } /* Literal */
.highlight .n { color: #f8f8f2 } /* Name */
.highlight .o { color: #f92672 } /* Operator */
.highlight .p { color: #f8f8f2 } /* Punctuation */
.highlight .ch { color: #75715e } /* Comment.Hashbang */
.highlight .cm { color: #75715e } /* Comment.Multiline */
.highlight .cp { color: #75715e } /* Comment.Preproc */
.highlight .cpf { color: #75715e } /* Comment.PreprocFile */
.highlight .c1 { color: #75715e } /* Comment.Single */
.highlight .cs { color: #75715e } /* Comment.Special */
.highlight .gd { color: #f92672 } /* Generic.Deleted */
.highlight .ge { font-style: italic } /* Generic.Emph */
.highlight .gi { color: #a6e22e } /* Generic.Inserted */
.highlight .gs { font-weight: bold } /* Generic.Strong */
.highlight .gu { color: #75715e } /* Generic.Subheading */
.highlight .kc { color: #66d9ef } /* Keyword.Constant */
.highlight .kd { color: #66d9ef } /* Keyword.Declaration */
.highlight .kn { color: #f92672 } /* Keyword.Namespace */
.highlight .kp { color: #66d9ef } /* Keyword.Pseudo */
.highlight .kr { color: #66d9ef } /* Keyword.Reserved */
.highlight .kt { color: #66d9ef } /* Keyword.Type */
.highlight .ld { color: #e6db74 } /* Literal.Date */
.highlight .m { color: #ae81ff } /* Literal.Number */
.highlight .s { color: #e6db74 } /* Literal.String */
.highlight .na { color: #a6e22e } /* Name.Attribute */
.highlight .nb { color: #f8f8f2 } /* Name.Builtin */
.highlight .nc { color: #a6e22e } /* Name.Class */
.highlight .no { color: #66d9ef } /* Name.Constant */
.highlight .nd { color: #a6e22e } /* Name.Decorator */
.highlight .ni { color: #f8f8f2 } /* Name.Entity */
.highlight .ne { color: #a6e22e } /* Name.Exception */
.highlight .nf { color: #a6e22e } /* Name.Function */
.highlight .nl { color: #f8f8f2 } /* Name.Label */
.highlight .nn { color: #f8f8f2 } /* Name.Namespace */
.highlight .nx { color: #a6e22e } /* Name.Other */
.highlight .py { color: #f8f8f2 } /* Name.Property */
.highlight .nt { color: #f92672 } /* Name.Tag */
.highlight .nv { color: #f8f8f2 } /* Name.Variable */
.highlight .ow { color: #f92672 } /* Operator.Word */
.highlight .w { color: #f8f8f2 } /* Text.Whitespace */
.highlight .mb { color: #ae81ff } /* Literal.Number.Bin */
.highlight .mf { color: #ae81ff } /* Literal.Number.Float */
.highlight .mh { color: #ae81ff } /* Literal.Number.Hex */
.highlight .mi { color: #ae81ff } /* Literal.Number.Integer */
.highlight .mo { color: #ae81ff } /* Literal.Number.Oct */
.highlight .sa { color: #e6db74 } /* Literal.String.Affix */
.highlight .sb { color: #e6db74 } /* Literal.String.Backtick */
.highlight .sc { color: #e6db74 } /* Literal.String.Char */
.highlight .dl { color: #e6db74 } /* Literal.String.Delimiter */
.highlight .sd { color: #e6db74 } /* Literal.String.Doc */
.highlight .s2 { color: #e6db74 } /* Literal.String.Double */
.highlight .se { color: #ae81ff } /* Literal.String.Escape */
.highlight .sh { color: #e6db74 } /* Literal.String.Heredoc */
.highlight .si { color: #e6db74 } /* Literal.String.Interpol */
.highlight .sx { color: #e6db74 } /* Literal.String.Other */
.highlight .sr { color: #e6db74 } /* Literal.String.Regex */
.highlight .s1 { color: #e6db74 } /* Literal.String.Single */
.highlight .ss { color: #e6db74 } /* Literal.String.Symbol */
.highlight .bp { color: #f8f8f2 } /* Name.Builtin.Pseudo */
.highlight .fm { color: #a6e22e } /* Name.Function.Magic */
.highlight .vc { color: #f8f8f2 } /* Name.Variable.Class */
.highlight .vg { color: #f8f8f2 } /* Name.Variable.Global */
.highlight .vi { color: #f8f8f2 } /* Name.Variable.Instance */
.highlight .vm { color: #f8f8f2 } /* Name.Variable.Magic */
.highlight .il { color: #ae81ff } /* Literal.Number.Integer.Long */
}
@media (prefers-color-scheme: light) {
.highlight .hll { background-color: #ffffcc }
.highlight .c { color: #888888 } /* Comment */
.highlight .err { color: #a61717; background-color: #e3d2d2 } /* Error */
.highlight .k { color: #008800; font-weight: bold } /* Keyword */
.highlight .ch { color: #888888 } /* Comment.Hashbang */
.highlight .cm { color: #888888 } /* Comment.Multiline */
.highlight .cp { color: #cc0000; font-weight: bold } /* Comment.Preproc */
.highlight .cpf { color: #888888 } /* Comment.PreprocFile */
.highlight .c1 { color: #888888 } /* Comment.Single */
.highlight .cs { color: #cc0000; font-weight: bold; background-color: #fff0f0 } /* Comment.Special */
.highlight .gd { color: #000000; background-color: #ffdddd } /* Generic.Deleted */
.highlight .ge { font-style: italic } /* Generic.Emph */
.highlight .gr { color: #aa0000 } /* Generic.Error */
.highlight .gh { color: #333333 } /* Generic.Heading */
.highlight .gi { color: #000000; background-color: #ddffdd } /* Generic.Inserted */
.highlight .go { color: #888888 } /* Generic.Output */
.highlight .gp { color: #555555 } /* Generic.Prompt */
.highlight .gs { font-weight: bold } /* Generic.Strong */
.highlight .gu { color: #666666 } /* Generic.Subheading */
.highlight .gt { color: #aa0000 } /* Generic.Traceback */
.highlight .kc { color: #008800; font-weight: bold } /* Keyword.Constant */
.highlight .kd { color: #008800; font-weight: bold } /* Keyword.Declaration */
.highlight .kn { color: #008800; font-weight: bold } /* Keyword.Namespace */
.highlight .kp { color: #008800 } /* Keyword.Pseudo */
.highlight .kr { color: #008800; font-weight: bold } /* Keyword.Reserved */
.highlight .kt { color: #888888; font-weight: bold } /* Keyword.Type */
.highlight .m { color: #0000DD; font-weight: bold } /* Literal.Number */
.highlight .s { color: #dd2200; background-color: #fff0f0 } /* Literal.String */
.highlight .na { color: #336699 } /* Name.Attribute */
.highlight .nb { color: #003388 } /* Name.Builtin */
.highlight .nc { color: #bb0066; font-weight: bold } /* Name.Class */
.highlight .no { color: #003366; font-weight: bold } /* Name.Constant */
.highlight .nd { color: #555555 } /* Name.Decorator */
.highlight .ne { color: #bb0066; font-weight: bold } /* Name.Exception */
.highlight .nf { color: #0066bb; font-weight: bold } /* Name.Function */
.highlight .nl { color: #336699; font-style: italic } /* Name.Label */
.highlight .nn { color: #bb0066; font-weight: bold } /* Name.Namespace */
.highlight .py { color: #336699; font-weight: bold } /* Name.Property */
.highlight .nt { color: #bb0066; font-weight: bold } /* Name.Tag */
.highlight .nv { color: #336699 } /* Name.Variable */
.highlight .ow { color: #008800 } /* Operator.Word */
.highlight .w { color: #bbbbbb } /* Text.Whitespace */
.highlight .mb { color: #0000DD; font-weight: bold } /* Literal.Number.Bin */
.highlight .mf { color: #0000DD; font-weight: bold } /* Literal.Number.Float */
.highlight .mh { color: #0000DD; font-weight: bold } /* Literal.Number.Hex */
.highlight .mi { color: #0000DD; font-weight: bold } /* Literal.Number.Integer */
.highlight .mo { color: #0000DD; font-weight: bold } /* Literal.Number.Oct */
.highlight .sa { color: #dd2200; background-color: #fff0f0 } /* Literal.String.Affix */
.highlight .sb { color: #dd2200; background-color: #fff0f0 } /* Literal.String.Backtick */
.highlight .sc { color: #dd2200; background-color: #fff0f0 } /* Literal.String.Char */
.highlight .dl { color: #dd2200; background-color: #fff0f0 } /* Literal.String.Delimiter */
.highlight .sd { color: #dd2200; background-color: #fff0f0 } /* Literal.String.Doc */
.highlight .s2 { color: #dd2200; background-color: #fff0f0 } /* Literal.String.Double */
.highlight .se { color: #0044dd; background-color: #fff0f0 } /* Literal.String.Escape */
.highlight .sh { color: #dd2200; background-color: #fff0f0 } /* Literal.String.Heredoc */
.highlight .si { color: #3333bb; background-color: #fff0f0 } /* Literal.String.Interpol */
.highlight .sx { color: #22bb22; background-color: #f0fff0 } /* Literal.String.Other */
.highlight .sr { color: #008800; background-color: #fff0ff } /* Literal.String.Regex */
.highlight .s1 { color: #dd2200; background-color: #fff0f0 } /* Literal.String.Single */
.highlight .ss { color: #aa6600; background-color: #fff0f0 } /* Literal.String.Symbol */
.highlight .bp { color: #003388 } /* Name.Builtin.Pseudo */
.highlight .fm { color: #0066bb; font-weight: bold } /* Name.Function.Magic */
.highlight .vc { color: #336699 } /* Name.Variable.Class */
.highlight .vg { color: #dd7700 } /* Name.Variable.Global */
.highlight .vi { color: #3333bb } /* Name.Variable.Instance */
.highlight .vm { color: #336699 } /* Name.Variable.Magic */
.highlight .il { color: #0000DD; font-weight: bold } /* Literal.Number.Integer.Long */
}
</style><div class="highlight"><pre><span></span>/*
 * Copyright (c) 2016 Cisco and/or its affiliates.
 * Copyright (c) 2016 Comcast Cable Communications Management, LLC.
 *
 * Licensed under the Apache License, Version 2.0 (the &quot;License&quot;);
 * you may not use this file except in compliance with the License.
 * You may obtain a copy of the License at:
 *
 *     http://www.apache.org/licenses/LICENSE-2.0
 *
 * Unless required by applicable law or agreed to in writing, software
 * distributed under the License is distributed on an &quot;AS IS&quot; BASIS,
 * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
 * See the License for the specific language governing permissions and
 * limitations under the License.
 */
/* Doxygen directory documentation */
/**
@dir
@brief VLIB application library.
*/
</pre></div>
</code></pre></td></tr></table>
</div> <!-- class=content -->
<div id="lfcollabprojects-footer">
  <div class="gray-diagonal">
    <div class="footer-inner">
      <p>
        &copy; 2016 <a href="https://www.fd.io/">FD.io</a> a Linux Foundation
        Collaborative Project. All Rights Reserved.
      </p>
      <p>
        Linux Foundation is a registered trademark of The Linux Foundation.
        Linux is a registered
        <a
          href="http://www.linuxfoundation.org/programs/legal/trademark"
          title="Linux Mark Institute"
          >trademark</a
        >
        of Linus Torvalds.
      </p>
      <p>
        Please see our
        <a href="http://www.linuxfoundation.org/privacy">privacy policy</a> and
        <a href="http://www.linuxfoundation.org/terms">terms of use</a>
      </p>
    </div>
  </div>
</div>
</div> <!-- id=cgit -->
</body>
</html>
ppropriate handler function. Each call to vlib_process_get_events
returns a vector of per-event-type data passed to successive
vlib_process_signal_event calls; it is a serious error to process only
event_data[0].

Resetting the event_data vector-length to 0 [instead of calling
vec_free] means that the event scheme doesn’t burn cycles continuously
allocating and freeing the event data vector. This is a common vppinfra
/ vlib coding pattern, well worth using when appropriate.

Signaling an event is easy, for example:

.. code:: c

       vlib_process_signal_event (vm, process_node_index, EVENT1,
           (uword)arbitrary_event1_data); /* and so forth */

One can either know the process node index by construction - dig it out
of the appropriate vlib_node_registration_t - or by finding the
vlib_node_t with vlib_get_node_by_name(…).

Buffers
-------

vlib buffering solves the usual set of packet-processing problems,
albeit at high performance. Key in terms of performance: one ordinarily
allocates / frees N buffers at a time rather than one at a time. Except
when operating directly on a specific buffer, one deals with buffers by
index, not by pointer.

Packet-processing frames are u32[] arrays, not vlib_buffer_t[] arrays.

Packets comprise one or more vlib buffers, chained together as required.
Multiple particle sizes are supported; hardware input nodes simply ask
for the required size(s). Coalescing support is available. For obvious
reasons one is discouraged from writing one’s own wild and wacky buffer
chain traversal code.

vlib buffer headers are allocated immediately prior to the buffer data
area. In typical packet processing this saves a dependent read wait:
given a buffer’s address, one can prefetch the buffer header [metadata]
at the same time as the first cache line of buffer data.

Buffer header metadata (vlib_buffer_t) includes the usual rewrite
expansion space, a current_data offset, RX and TX interface indices,
packet trace information, and a opaque areas.

The opaque data is intended to control packet processing in arbitrary
subgraph-dependent ways. The programmer shoulders responsibility for
data lifetime analysis, type-checking, etc.

Buffers have reference-counts in support of e.g. multicast replication.

Shared-memory message API
-------------------------

Local control-plane and application processes interact with the vpp
dataplane via asynchronous message-passing in shared memory over
unidirectional queues. The same application APIs are available via
sockets.

Capturing API traces and replaying them in a simulation environment
requires a disciplined approach to the problem. This seems like a
make-work task, but it is not. When something goes wrong in the
control-plane after 300,000 or 3,000,000 operations, high-speed replay
of the events leading up to the accident is a huge win.

The shared-memory message API message allocator vl_api_msg_alloc uses a
particularly cute trick. Since messages are processed in order, we try
to allocate message buffering from a set of fixed-size, preallocated
rings. Each ring item has a “busy” bit. Freeing one of the preallocated
message buffers merely requires the message consumer to clear the busy
bit. No locking required.

Debug CLI
---------

Adding debug CLI commands to VLIB applications is very simple.

Here is a complete example:

.. code:: c

       static clib_error_t *
       show_ip_tuple_match (vlib_main_t * vm,
                            unformat_input_t * input,
                            vlib_cli_command_t * cmd)
       {
           vlib_cli_output (vm, "%U\n", format_ip_tuple_match_tables, &routing_main);
           return 0;
       }

       static VLIB_CLI_COMMAND (show_ip_tuple_command) =
       {
           .path = "show ip tuple match",
           .short_help = "Show ip 5-tuple match-and-broadcast tables",
           .function = show_ip_tuple_match,
       };

This example implements the “show ip tuple match” debug cli command. In
ordinary usage, the vlib cli is available via the “vppctl” application,
which sends traffic to a named pipe. One can configure debug CLI telnet
access on a configurable port.

The cli implementation has an output redirection facility which makes it
simple to deliver cli output via shared-memory API messaging,

Particularly for debug or “show tech support” type commands, it would be
wasteful to write vlib application code to pack binary data, write more
code elsewhere to unpack the data and finally print the answer. If a
certain cli command has the potential to hurt packet processing
performance by running for too long, do the work incrementally in a
process node. The client can wait.

Macro expansion
~~~~~~~~~~~~~~~

The vpp debug CLI engine includes a recursive macro expander. This is
quite useful for factoring out address and/or interface name specifics:

::

      define ip1 192.168.1.1/24
      define ip2 192.168.2.1/24
      define iface1 GigabitEthernet3/0/0
      define iface2 loop1

      set int ip address $iface1 $ip1
      set int ip address $iface2 $(ip2)

      undefine ip1
      undefine ip2
      undefine iface1
      undefine iface2

Each socket (or telnet) debug CLI session has its own macro tables. All
debug CLI sessions which use CLI_INBAND binary API messages share a
single table.

The macro expander recognizes circular definitions:

::

       define foo \$(bar)
       define bar \$(mumble)
       define mumble \$(foo)

At 8 levels of recursion, the macro expander throws up its hands and
replies “CIRCULAR.”

Macro-related debug CLI commands
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

In addition to the “define” and “undefine” debug CLI commands, use “show
macro [noevaluate]” to dump the macro table. The “echo” debug CLI
command will evaluate and print its argument:

::

       vpp# define foo This\ Is\ Foo
       vpp# echo $foo
       This Is Foo

Handing off buffers between threads
-----------------------------------

Vlib includes an easy-to-use mechanism for handing off buffers between
worker threads. A typical use-case: software ingress flow hashing. At a
high level, one creates a per-worker-thread queue which sends packets to
a specific graph node in the indicated worker thread. With the queue in
hand, enqueue packets to the worker thread of your choice.

Initialize a handoff queue
~~~~~~~~~~~~~~~~~~~~~~~~~~

Simple enough, call vlib_frame_queue_main_init:

.. code:: c

      main_ptr->frame_queue_index
          = vlib_frame_queue_main_init (dest_node.index, frame_queue_size);

Frame_queue_size means what it says: the number of frames which may be
queued. Since frames contain 1…256 packets, frame_queue_size should be a
reasonably small number (32…64). If the frame queue producer(s) are
faster than the frame queue consumer(s), congestion will occur. Suggest
letting the enqueue operator deal with queue congestion, as shown in the
enqueue example below.

Under the floorboards, vlib_frame_queue_main_init creates an input queue
for each worker thread.

Please do NOT create frame queues until it’s clear that they will be
used. Although the main dispatch loop is reasonably smart about how
often it polls the (entire set of) frame queues, polling unused frame
queues is a waste of clock cycles.

Hand off packets
~~~~~~~~~~~~~~~~

The actual handoff mechanics are simple, and integrate nicely with a
typical graph-node dispatch function:

.. code:: c

       always_inline uword
       do_handoff_inline (vlib_main_t * vm,
                      vlib_node_runtime_t * node, vlib_frame_t * frame,
                      int is_ip4, int is_trace)
       {
         u32 n_left_from, *from;
         vlib_buffer_t *bufs[VLIB_FRAME_SIZE], **b;
         u16 thread_indices [VLIB_FRAME_SIZE];
         u16 nexts[VLIB_FRAME_SIZE], *next;
         u32 n_enq;
         htest_main_t *hmp = &htest_main;
         int i;

         from = vlib_frame_vector_args (frame);
         n_left_from = frame->n_vectors;

         vlib_get_buffers (vm, from, bufs, n_left_from);
         next = nexts;
         b = bufs;

         /*
          * Typical frame traversal loop, details vary with
          * use case. Make sure to set thread_indices[i] with
          * the desired destination thread index. You may
          * or may not bother to set next[i].
          */

         for (i = 0; i < frame->n_vectors; i++)
           {
             <snip>
             /* Pick a thread to handle this packet */
             thread_indices[i] = f (packet_data_or_whatever);
             <snip>

             b += 1;
             next += 1;
             n_left_from -= 1;
           }

          /* Enqueue buffers to threads */
          n_enq =
           vlib_buffer_enqueue_to_thread (vm, node, hmp->frame_queue_index,
                                          from, thread_indices, frame->n_vectors,
                                          1 /* drop on congestion */);
          /* Typical counters,
         if (n_enq < frame->n_vectors)
           vlib_node_increment_counter (vm, node->node_index,
                        XXX_ERROR_CONGESTION_DROP,
                        frame->n_vectors - n_enq);
         vlib_node_increment_counter (vm, node->node_index,
                            XXX_ERROR_HANDED_OFF, n_enq);
         return frame->n_vectors;
   }

Notes about calling vlib_buffer_enqueue_to_thread(…):

-  If you pass “drop on congestion” non-zero, all packets in the inbound
   frame will be consumed one way or the other. This is the recommended
   setting.

-  In the drop-on-congestion case, please don’t try to “help” in the
   enqueue node by freeing dropped packets, or by pushing them to
   “error-drop.” Either of those actions would be a severe error.

-  It’s perfectly OK to enqueue packets to the current thread.

Handoff Demo Plugin
-------------------

Check out the sample (plugin) example in …/src/examples/handoffdemo. If
you want to build the handoff demo plugin:

::

   $ cd .../src/plugins
   $ ln -s ../examples/handoffdemo

This plugin provides a simple example of how to hand off packets between
threads. We used it to debug packet-tracer handoff tracing support.

Packet generator input script
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

::

    packet-generator new {
       name x
       limit 5
       size 128-128
       interface local0
       node handoffdemo-1
       data {
           incrementing 30
       }
    }

Start vpp with 2 worker threads
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

The demo plugin hands packets from worker 1 to worker 2.

Enable tracing, and start the packet generator
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

::

     trace add pg-input 100
     packet-generator enable

Sample Run
~~~~~~~~~~

::

     DBGvpp# ex /tmp/pg_input_script
     DBGvpp# pa en
     DBGvpp# sh err
      Count                    Node                  Reason
            5              handoffdemo-1             packets handed off processed
            5              handoffdemo-2             completed packets
     DBGvpp# show run
     Thread 1 vpp_wk_0 (lcore 0)
     Time 133.9, average vectors/node 5.00, last 128 main loops 0.00 per node 0.00
       vector rates in 3.7331e-2, out 0.0000e0, drop 0.0000e0, punt 0.0000e0
                  Name                 State         Calls          Vectors        Suspends         Clocks       Vectors/Call
     handoffdemo-1                    active                  1               5               0          4.76e3            5.00
     pg-input                        disabled                 2               5               0          5.58e4            2.50
     unix-epoll-input                 polling             22760               0               0          2.14e7            0.00
     ---------------
     Thread 2 vpp_wk_1 (lcore 2)
     Time 133.9, average vectors/node 5.00, last 128 main loops 0.00 per node 0.00
       vector rates in 0.0000e0, out 0.0000e0, drop 3.7331e-2, punt 0.0000e0
                  Name                 State         Calls          Vectors        Suspends         Clocks       Vectors/Call
     drop                             active                  1               5               0          1.35e4            5.00
     error-drop                       active                  1               5               0          2.52e4            5.00
     handoffdemo-2                    active                  1               5               0          2.56e4            5.00
     unix-epoll-input                 polling             22406               0               0          2.18e7            0.00

Enable the packet tracer and run it again…

::

     DBGvpp# trace add pg-input 100
     DBGvpp# pa en
     DBGvpp# sh trace
     sh trace
     ------------------- Start of thread 0 vpp_main -------------------
     No packets in trace buffer
     ------------------- Start of thread 1 vpp_wk_0 -------------------
     Packet 1

     00:06:50:520688: pg-input
       stream x, 128 bytes, 0 sw_if_index
       current data 0, length 128, buffer-pool 0, ref-count 1, trace handle 0x1000000
       00000000: 000102030405060708090a0b0c0d0e0f101112131415161718191a1b1c1d0000
       00000020: 0000000000000000000000000000000000000000000000000000000000000000
       00000040: 0000000000000000000000000000000000000000000000000000000000000000
       00000060: 0000000000000000000000000000000000000000000000000000000000000000
     00:06:50:520762: handoffdemo-1
       HANDOFFDEMO: current thread 1

     Packet 2

     00:06:50:520688: pg-input
       stream x, 128 bytes, 0 sw_if_index
       current data 0, length 128, buffer-pool 0, ref-count 1, trace handle 0x1000001
       00000000: 000102030405060708090a0b0c0d0e0f101112131415161718191a1b1c1d0000
       00000020: 0000000000000000000000000000000000000000000000000000000000000000
       00000040: 0000000000000000000000000000000000000000000000000000000000000000
       00000060: 0000000000000000000000000000000000000000000000000000000000000000
     00:06:50:520762: handoffdemo-1
       HANDOFFDEMO: current thread 1

     Packet 3

     00:06:50:520688: pg-input
       stream x, 128 bytes, 0 sw_if_index
       current data 0, length 128, buffer-pool 0, ref-count 1, trace handle 0x1000002
       00000000: 000102030405060708090a0b0c0d0e0f101112131415161718191a1b1c1d0000
       00000020: 0000000000000000000000000000000000000000000000000000000000000000
       00000040: 0000000000000000000000000000000000000000000000000000000000000000
       00000060: 0000000000000000000000000000000000000000000000000000000000000000
     00:06:50:520762: handoffdemo-1
       HANDOFFDEMO: current thread 1

     Packet 4

     00:06:50:520688: pg-input
       stream x, 128 bytes, 0 sw_if_index
       current data 0, length 128, buffer-pool 0, ref-count 1, trace handle 0x1000003
       00000000: 000102030405060708090a0b0c0d0e0f101112131415161718191a1b1c1d0000
       00000020: 0000000000000000000000000000000000000000000000000000000000000000
       00000040: 0000000000000000000000000000000000000000000000000000000000000000
       00000060: 0000000000000000000000000000000000000000000000000000000000000000
     00:06:50:520762: handoffdemo-1
       HANDOFFDEMO: current thread 1

     Packet 5

     00:06:50:520688: pg-input
       stream x, 128 bytes, 0 sw_if_index
       current data 0, length 128, buffer-pool 0, ref-count 1, trace handle 0x1000004
       00000000: 000102030405060708090a0b0c0d0e0f101112131415161718191a1b1c1d0000
       00000020: 0000000000000000000000000000000000000000000000000000000000000000
       00000040: 0000000000000000000000000000000000000000000000000000000000000000
       00000060: 0000000000000000000000000000000000000000000000000000000000000000
     00:06:50:520762: handoffdemo-1
       HANDOFFDEMO: current thread 1

     ------------------- Start of thread 2 vpp_wk_1 -------------------
     Packet 1

     00:06:50:520796: handoff_trace
       HANDED-OFF: from thread 1 trace index 0
     00:06:50:520796: handoffdemo-2
       HANDOFFDEMO: current thread 2
     00:06:50:520867: error-drop
       rx:local0
     00:06:50:520914: drop
       handoffdemo-2: completed packets

     Packet 2

     00:06:50:520796: handoff_trace
       HANDED-OFF: from thread 1 trace index 1
     00:06:50:520796: handoffdemo-2
       HANDOFFDEMO: current thread 2
     00:06:50:520867: error-drop
       rx:local0
     00:06:50:520914: drop
       handoffdemo-2: completed packets

     Packet 3

     00:06:50:520796: handoff_trace
       HANDED-OFF: from thread 1 trace index 2
     00:06:50:520796: handoffdemo-2
       HANDOFFDEMO: current thread 2
     00:06:50:520867: error-drop
       rx:local0
     00:06:50:520914: drop
       handoffdemo-2: completed packets

     Packet 4

     00:06:50:520796: handoff_trace
       HANDED-OFF: from thread 1 trace index 3
     00:06:50:520796: handoffdemo-2
       HANDOFFDEMO: current thread 2
     00:06:50:520867: error-drop
       rx:local0
     00:06:50:520914: drop
       handoffdemo-2: completed packets

     Packet 5

     00:06:50:520796: handoff_trace
       HANDED-OFF: from thread 1 trace index 4
     00:06:50:520796: handoffdemo-2
       HANDOFFDEMO: current thread 2
     00:06:50:520867: error-drop
       rx:local0
     00:06:50:520914: drop
       handoffdemo-2: completed packets
    DBGvpp#