summaryrefslogtreecommitdiffstats
path: root/docs/clamp/acm/controlloop-architecture.rst
blob: 1c09bea0aeb2356f6b5fac375acb98692ad11519 (plain)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
.. This work is licensed under a Creative Commons Attribution 4.0 International License.

.. _clamp-controlloop_architecture-label:

TOSCA Defined Control Loops: Architecture and Design
####################################################


.. contents::
    :depth: 4

The idea of using control loops to automatically (or autonomously) perform network management
has been the subject of much research in the Network Management research community, see
:download:`this paper <files/ControlLoops.pdf>` for some background. However, it is only with
the advent of ONAP that we have a platform that supports control loops for network management.
Before ONAP, Control Loops have been implemented by hard-coding components together and hard
coding logic into components. ONAP has taken a step forward towards automatic implementation
of Control Loops by allowing parameterization of Control Loops that work on the premise that
the Control Loops use a set of analytic, policy, and control components connected together in
set ways.

The goal of the work is to extend and enhance the current ONAP Control Loop support to provide
a complete open-source framework for Control Loops. This will enhance the current support to
provide TOSCA based Control Loop definition and development, commissioning and run-time management.
The participants that comprise a Control Loop and the metadata needed to link the participants
together to create a Control Loop are specified in a standardized way using the `OASIS TOSCA
modelling language <http://docs.oasis-open.org/tosca/TOSCA-Simple-Profile-YAML/>`_. The TOSCA
description is then used to commission, instantiate, and manage the Control Loops in the run
time system.

.. image:: images/01-controlloop-overview.png

1 Terminology
=============

This section describes the terminology used in the system.

1.1 Control Loop Terminology
----------------------------

**Control Loop Type:** A definition of a Control Loop in the TOSCA language. This definition describes
a certain type of a control loop. The life cycle of instances of a Control Loop Type are managed
by CLAMP.

**Control Loop Instance:** An instance of a Control Loop Type. The life cycle of a Control Loop
Instance is managed by CLAMP. A Control Loop Instance is a set of executing elements on which
Life Cycle Management (LCM) is executed collectively. For example, a set of microservices may be
spawned and executed together to deliver a service. This collection of services is a control loop.

**Control Loop Element Type:** A definition of a Control Loop Element in the TOSCA language. This
definition describes a certain type of Control Loop Element for a control loop in a Control
Loop Type.

**Control Loop Element Instance:** A single entity executing on a participant, with its Life Cycle
being managed as part of the overall control loop. For example, a single microservice that is
executing as one microservice in a service.

**CLAMP Control Loop Runtime:** The CLAMP server that holds Control Loop Type definitions and manages
the life cycle of Control Loop Instances and their Control Loop Elements in cooperation with
participants.


1.2 Participant Terminology
---------------------------

**Participant Type:** Definition of a type of system or framework that can take part in control
loops and a definition of the capabilities of that participant type. A participant advertises
its type to the CLAMP Control Loop Runtime.

**Participant:** A system or framework that takes part in control loops by executing Control Loop
Elements in cooperation with the CLAMP Control Loop Runtime. A participant chooses to partake
in control loops, to manage Control Loop Elements for CLAMP, and to receive, send and act on
LCM messages for the CLAMP runtime.

1.3 Terminology for Properties
------------------------------

**Common Properties:** Properties that apply to all Control Loop Instances of a certain Control
Loop Type and are specified when a Control Loop Type is commissioned.

**Instance Specific Properties:** Properties that must be specified for each Control Loop Instance
and are specified when a Control Loop Instance is Initialized.

1.4 Concepts and their relationships
------------------------------------

The UML diagram below shows the concepts described in the terminology sections above and how
they are interrelated.

.. image:: images/02-controlloop-concepts.png

The Control Loop Definition concepts describe the types of things that are in the system. These
concepts are defined at design time and are passed to the runtime in a TOSCA document. The
concepts in the Control Loop Runtime are created by the runtime part of the system using the
definitions created at design time.

.. _controlloop-capabilities:

2 Capabilities
==============

We consider the capabilities of Control Loops at Design Time and Run Time.

At Design Time, three capabilities are supported:

#. **Control Loop Element Definition Specification.** This capability allows users to define Control
   Loop Element Types and the metadata that can be used on and configured on a Control Loop Element
   Type. Users also define the Participant Type that will run the Control Loop Element when it is
   taking part in in a control loop. The post condition of an execution of this capability is that
   metadata for a Control Loop Element Type is defined in the Control Loop Design Time Catalogue.

#. **Control Loop Element Definition Onboarding.** This capability allows external users and systems
   (such as SDC or DCAE-MOD) to define the metadata that can be used on and configured on a Control
   Loop Element Type and to define the Participant Type that will run the Control Loop Element when
   it is taking part in in a control loop. The post condition of an execution of this capability
   is that metadata for a Control Loop Element Type is defined in the Control Loop Design Time
   Catalogue.

#. **Control Loop Type Definition.** This capability allows users and other systems to create Control
   Loop Type definitions by specifying a set of Control Loop Element Definitions from those that
   are available in the Control Loop Design Time Catalogue. These Control Loop Elements will
   work together to form Control Loops. In an execution of this capability, a user specifies the
   metadata for the Control Loop and specifies the set of Control Loop Elements and their Participant
   Types. The user also selects the correct metadata sets for each participant in the Control Loop
   Type and defines the overall Control Loop Type metadata. The user also specifies the Common
   Property Types that apply to all instances of a control loop type and the Instance Specific
   Property Types that apply to individual instances of a Control Loop Type. The post condition for
   an execution of this capability is a Control Loop definition in TOSCA stored in the Control Loop
   Design Time Catalogue.

.. note::
    Once a Control Loop Definition is commissioned to the Control Loop Runtime and has been
    stored in the Run Time Inventory, it cannot be further edited unless it is decommissioned.


At Run Time, the following participant related capabilities are supported:

#. **System Pre-Configuration.** This capability allows participants to register and deregister
   with CLAMP. Participants explicitly register with CLAMP when they start. Control Loop Priming
   is performed on each participant once it registers. The post condition for an execution of this
   capability is that a participant becomes available (registration) or is no longer available
   (deregistration) for participation in a control loop.

#. **Control Loop Priming on Participants.** A participant is primed to support a Control Loop Type.
   Priming a participant means that the definition of a control loop and the values of Common
   Property Types that apply to all instances of a control loop type on a participant are sent
   to a participant. The participant can then take whatever actions it need to do to support
   the control loop type in question. Control Loop Priming takes place at participant
   registration and at Control Loop Commissioning. The post condition for an execution of this
   capability is that all participants in this control loop type are commissioned, that is they
   are prepared to run instances of their Control Loop Element types.


At Run Time, the following Control Loop Life Cycle management capabilities are supported:

#. **Control Loop Commissioning:** This capability allows version controlled Control Loop Type
   definitions to be taken from the Control Loop Design Time Catalogue and be placed in the
   Commissioned Control Loop Inventory. It also allows the values of Common Property Types
   that apply to all instances of a Control Loop Type to be set. Further, the Control Loop
   Type is primed on all concerned participants. The post condition for an execution of this
   capability is that the Control Loop Type definition is in the Commissioned Control Loop
   Inventory and the Control Loop Type is primed on concerned participants.

#. **Control Loop Instance Life Cycle Management:** This capability allows a Control Loop
   Instance to have its life cycle managed.

   #. **Control Loop Instance Creation:** This capability allows a Control Loop Instance to be
      created. The Control Loop Type definition is read from the Commissioned Control Loop
      Inventory and values are assigned to the Instance Specific Property Types defined for
      instances of the Control Loop Type in the same manner as the existing CLAMP client does.
      A Control Loop Instance that has been created but has not yet been instantiated on
      participants is in state UNINITIALIZED. In this state, the Instance Specific Property Type
      values can be revised and updated as often as the user requires. The post condition for an
      execution of this capability is that the Control Loop instance is created in the
      Instantiated Control Loop Inventory but has not been instantiated on Participants.

   #. **Control Loop Instance Update on Participants:** Once the user is happy with the property
      values, the Control Loop Instance is updated on participants and the Control Loop Elements
      for this Control Loop Instance are initialized or updated by participants using the control
      loop metadata. The post condition for an execution of this capability is that the Control
      Loop instance is updated on Participants.

   #. **Control Loop State Change:** The user can now order the participants to change the state
      of the Control Loop Instance. If the Control Loop is set to state RUNNING, each participant
      begins accepting and processing control loop events and the Control Loop Instance is set
      to state RUNNING in the Instantiated Control Loop inventory. The post condition for an
      execution of this capability is that the Control Loop instance state is changed on
      participants.

   #. **Control Loop Instance Monitoring:** This capability allows Control Loop Instances to be
      monitored. Users can check the status of Participants, Control Loop Instances, and Control
      Loop Elements. Participants report their overall status and the status of Control Loop
      Elements they are running periodically to CLAMP. Clamp aggregates these status reports
      into an aggregated Control Loop Instance status record, which is available for monitoring.
      The post condition for an execution of this capability is that Control Loop Instances are
      being monitored.

   #. **Control Loop Instance Supervision:** This capability allows Control Loop Instances to be
      supervised. The CLAMP runtime expects participants to report on Control Loop Elements
      periodically. The CLAMP runtime checks that periodic reports are received and that each
      Control Loop Element is in the state it should be in. If reports are missed or if a
      Control Loop Element is in an incorrect state, remedial action is taken and notifications
      are issued. The post condition for an execution of this capability is that Control Loop
      Instances are being supervised by the CLAMP runtime.

   #. **Control Loop Instance Removal from Participants:** A user can order the removal of a Control
      Loop Instance from participants. The post condition for an execution of this capability is
      that the Control Loop instance is removed from Participants.

   #. **Control Loop Instance Deletion:** A user can order the removal of a Control Loop Instance
      from the CLAMP runtime. Control Loop Instances that are instantiated on participants cannot
      be removed from the CLAMP runtime. The post condition for an execution of this capability
      is that the Control Loop instance is removed from Instantiated Control Loop Inventory.

#. **Control Loop Decommissioning:** This capability allows version controlled Control Loop Type
   definitions to be removed from the Commissioned Control Loop Inventory. A Control Loop
   Definition that has instances in the Instantiated Control Loop Inventory cannot be removed.
   The post condition for an execution of this capability is that the Control Loop Type
   definition removed from the Commissioned Control Loop Inventory.

.. note::
    The system dialogues for run time capabilities are described in detail on the
    :ref:`System Level Dialogues <system-level-label>` page.

.. _controlloop-instance-states:

2.1 Control Loop Instance States
--------------------------------

When a control loop definition has been commissioned, instances of the control loop can be
created, updated, and deleted. The system manages the lifecycle of control loops and control
loop elements following the state transition diagram below.

.. image:: images/03-controlloop-instance-states.png

3 Overall Target Architecture
=============================

The diagram below shows an overview of the architecture of TOSCA based Control Loop
Management in CLAMP.

.. image:: images/04-overview.png

Following the ONAP Reference Architecture, the architecture has a Design Time part and
a Runtime part.

The Design Time part of the architecture allows a user to specify metadata for participants.
It also allows users to compose control loops. The Design Time Catalogue contains the metadata
primitives and control loop definition primitives for composition of control loops. As shown
in the figure above, the Design Time component provides a system where Control Loops can be
designed and defined in metadata. This means that a Control Loop can have any arbitrary
structure and the Control Loop developers can use whatever analytic, policy, or control
participants they like to implement their Control Loop. At composition time, the user
parameterises the Control Loop and stores it in the design time catalogue. This catalogue
contains the primitive metadata for any participants that can be used to compose a Control
Loop. A Control Loop SDK is used to compose a Control Loop by aggregating the metadata for
the participants chosen to be used in a Control Loop and by constructing the references between
the participants. The architecture of the Control Loop Design Time part will be elaborated in
future releases.

Composed Control Loops are commissioned on the run time part of the system, where they are
stored in the Commissioned Control Loop inventory and are available for instantiation. The
Commissioning component provides a CRUD REST interface for Control Loop Types, and implements
CRUD of Control Loop Types. Commissioning also implements validation and persistence of incoming
Control Loop Types. It also guarantees the integrity of updates and deletions of Control Loop
Types, such as performing updates in accordance with semantic versioning rules and ensuring that
deletions are not allowed on Control Loop Types that have instances defined.

The Instantiation component manages the Life Cycle Management of Control Loop Instances and
their Control Loop Elements. It publishes a REST interface that is used to create Control Loop
Instances and set values for Common and Instance Specific properties. This REST interface is
public and is used by the CLAMP GUI. It may also be used by any other client via the public
REST interface. the REST interface also allows the state of Control Loop Instances to be changed.
A user can change the state of Control Loop Instances as described in the state transition
diagram shown in section 2 above. The Instantiation component issues update and state change
messages via DMaaP to participants so that they can update and manage the state of the Control
Loop Elements they are responsible for. The Instantiation component also implements persistence
of Control Loop Instances, control loop elements, and their state changes.

The Monitoring component reads updates sent by participants. Participants report on the
state of their Control Loop Elements periodically and in response to a message they have
received from the Instantiation component. The Monitoring component reads the contents of
the participant messages and persists their state updates and statistics records. It also
publishes a REST interface that publishes the current state of all Participants, Control
Loop Instances and their Control Loop Elements, as well as publishing Participant and
Control Loop statistics.

The Supervision component is responsible for checking that Control Loop Instances are correctly
instantiated and are in the correct state (UNINITIALIZED/READY/RUNNING). It also handles
timeouts and on state changes to Control Loop Instances, and retries and rolls back state
changes where state changes failed.

A Participant is an executing component that partakes in control loops. More explicitly, a
Participant is something that implements the Participant Instantiation and Participant
Monitoring messaging protocol over DMaaP for Life Cycle management of Control Loop Elements.
A Participant runs Control Loop Elements and manages and reports on their life cycle
following the instructions it gets from the CLAMP runtime in messages delivered over DMaaP.

In the figure above, five participants are shown. A Configuration Persistence Participant
manages Control Loop Elements that interact with the `ONAP Configuration Persistence Service
<https://docs.onap.org/projects/onap-cps/en/latest/overview.html>`_
to store common data. The DCAE Participant runs Control Loop Elements that manage DCAE
microservices. The Kubernetes Participant hosts the Control Loop Elements that are managing
the life cycle of microservices in control loops that are in a Kubernetes ecosystem. The
Policy Participant handles the Control Loop Elements that interact with the Policy Framework
to manage policies for control loops. A Controller Participant such as the CDS Participant
runs Control Loop Elements that load metadata and configure controllers so that they can
partake in control loops. Any third party Existing System Participant can be developed to
run Control Loop Elements that interact with any existing system (such as an operator's
analytic, machine learning, or artificial intelligence system) so that those systems can
partake in control loops.

4. Other Considerations
=======================

.. _management-cl-instance-configs:

4.1 Management of Control Loop Instance Configurations
------------------------------------------------------

In order to keep management of versions of the configuration of control loop instances
straightforward and easy to implement, the following version management scheme using
semantic versioning is implemented. Each configuration of a Control Loop Instance and
configuration of a Control Loop Element has a semantic version with 3 digits indicating
the **major.minor.patch** number of the version.

.. note::
    A **configuration** means a full set of parameter values for a Control Loop Instance.

.. image:: images/05-upgrade-states.png

Change constraints:

#. A Control Loop or Control Loop Element in state **RUNNING** can be changed to a higher patch
   level or rolled back to a lower patch level. This means that hot changes that do not
   impact the structure of a Control Loop or its elements can be executed.

#. A Control Loop or Control Loop Element in state **PASSIVE** can be changed to a higher
   minor/patch level or rolled back to a lower minor/patch level. This means that structural
   changes to Control Loop Elements that do not impact the Control Loop as a whole can be
   executed by taking the control loop to state **PASSIVE**.

#. A Control Loop or Control Loop Element in state **UNINITIALIZED** can be changed to a higher
   major/minor/patch level or rolled back to a lower major/minor/patch level. This means
   that where the structure of the entire control loop is changed, the control loop must
   be uninitialized and reinitialized.

#. If a Control Loop Element has a **minor** version change, then its Control Loop Instance
   must have at least a **minor** version change.

#. If a Control Loop Element has a **major** version change, then its Control Loop Instance
   must have a **major** version change.

4.2 Scalability
---------------

The system is designed to be inherently scalable. The CLAMP runtime is stateless, all state
is preserved in the Instantiated Control Loop inventory in the database. When the user
requests an operation such as an instantiation, activation, passivation, or an uninitialization
on a Control Loop Instance, the CLAMP runtime broadcasts the request to participants over
DMaaP and saves details of the request to the database. The CLAMP runtime does not directly
wait for responses to requests.

When a request is broadcast on DMaaP, the request is asynchronously picked up by participants
of the types required for the Control Loop Instance and those participants manage the life
cycle of its control loop elements. Periodically, each participant reports back on the status
of operations it has picked up for the Control Loop Elements it controls, together with
statistics on the Control Loop Elements over DMaaP. On reception of these participant messages,
the CLAMP runtime stores this information to its database.

The participant to use on a control loop can be selected from the registered participants
in either of two ways:

**Runtime-side Selection:** The CLAMP runtime selects a suitable participant from the list of
participants and sends the participant ID that should be used in the Participant Update message.
In this case, the CLAMP runtime decides on which participant will run the Control Loop Element
based on a suitable algorithm. Algorithms could be round robin based or load based.

**Participant-side Selection:** The CLAMP runtime sends a list of Participant IDs that may be used
in the Participant Update message. In this case, the candidate participants decide among
themselves which participant should host the Control Loop Element.

This approach makes it easy to scale Control Loop life cycle management. As Control Loop
Instance counts increase, more than one CLAMP runtime can be deployed and REST/supervision
operations on Control Loop Instances can run in parallel. The number of participants can
scale because an asynchronous broadcast mechanism is used for runtime-participant communication
and there is no direct connection or communication channel between participants and CLAMP
runtime servers. Participant state, Control Loop Instance state, and Control Loop Element
state is held in the database, so any CLAMP runtime server can handle operations for any
participant. Because many participants of a particular type can be deployed and participant
instances can load balance control loop element instances for different Control Loop Instances
of many types across themselves using a mechanism such as a Kubernetes cluster.


4.3 Sandboxing and API Gateway Support
--------------------------------------

At runtime, interaction between ONAP platform services and application microservices are
relatively unconstrained, so interactions between Control Loop Elements for a given Control
Loop Instance remain relatively unconstrained. A
`proposal to support access-controlled access to and between ONAP services
<https://wiki.onap.org/pages/viewpage.action?pageId=103417456>`_
will improve this. This can be complemented by intercepting and controlling services
accesses between Control Loop Elements for Control Loop Instances for some/all Control
Loop types.

API gateways such as `Kong <https://konghq.com/kong/>`_ have emerged as a useful technology
for exposing and controlling service endpoint access for applications and services. When a
Control Loop Type is onboarded, or when Control Loop Instances are created in the Participants,
CLAMP can configure service endpoints between Control Loop Elements to redirect through an
API Gateway.

Authentication and access-control rules can then be dynamically configured at the API gateway
to support constrained access between Control Loop Elements and Control Loop Instances.

The diagram below shows the approach for configuring API Gateway access at Control Loop
Instance and Control Loop Element level.

.. image:: images/06-api-gateway-sandbox.png

At design time, the Control Loop type definition specifies the type of API gateway configuration
that should be supported at Control Loop and Control Loop Element levels.

At runtime, the CLAMP can configure the API gateway to enable (or deny) interactions between
Control Loop Instances and individually for each Control Loop Element. All service-level
interactions in/out of a Control Loop Element, except that to/from the API Gateway, can be
blocked by networking policies, thus sandboxing a Control Loop Element and an entire Control
Loop Instance if desired. Therefore, a Control Loop Element will only have access to the APIs
that are configured and enabled for the Control Loop Element/Instance in the API gateway.

For some Control Loop Element Types the Participant can assist with service endpoint
reconfiguration, service request/response redirection to/from the API Gateway, or
annotation of requests/responses.

Once the Control Loop instance is instantiated on participants, the participants configure
the API gateway with the Control Loop Instance level configuration and with the specific
configuration for their Control Loop Element.

Monitoring and logging of the use of the API gateway may also be provided. Information and
statistics on API gateway use can be read from the API gateway and passed back in monitoring
messages to the CLAMP runtime.

Additional isolation and execution-environment sandboxing can be supported depending on the
Control Loop Element Type. For example: ONAP policies for given Control Loop Instances/Types
can be executed in a dedicated PDP engine instances; DCAE or K8S-hosted services can executed
in isolated namespaces or in dedicated workers/clusters; etc..


5 APIs and Protocols
====================

The APIs and Protocols used by CLAMP for Control Loops are described on the pages below:

#. :ref:`System Level Dialogues <system-level-label>`
#. :ref:`The CLAMP Control Loop Participant Protocol <controlloop-participant-protocol-label>`
#. :ref:`REST APIs for CLAMP Control Loops <controlloop-rest-apis-label>`


6 Design and Implementation
===========================

The design and implementation of TOSCA Control Loops in CLAMP is described for each executable entity on the pages below:

#. :ref:`The CLAMP Control Loop Runtime Server <clamp-runtime-acm>`
#. :ref:`CLAMP Control Loop Participants <clamp-controlloop-participants>`
#. :ref:`Managing Control Loops using The CLAMP GUI <clamp-gui-controlloop>`

End of Document