Age | Commit message (Collapse) | Author | Files | Lines |
|
To improve reliability of healthchecks in CSITs, they are implemented
in docker-compose. This commit moves SDNC and dependencies into main
docker-compose and adds healthchecks to them. This improves the CSIT
in that tests will quickly fail if SDNC or PNFsim containers fail to
start (currently tests run anyway even if containers are not ready).
- Move SDNC, MariaDB and PNFSim containers to main docker-compose
- Add healthchecks and timeouts for those containers
- Move node mounting script as a sidecar container so it is protected
by healthchecks
Issue-ID: CPS-2632
Signed-off-by: danielhanrahan <daniel.hanrahan@est.tech>
Change-Id: Ib53522c2f756d3ce2c6d6b7472cb4c65359fe355
|
|
- increase the healthcheck retries from 3 to 10
Issue-ID: CPS-2679
Change-Id: Idd541fd88eb804d552659c32f6c6c92a0919c405
Signed-off-by: halil.cakal <halil.cakal@est.tech>
|
|
Issue-ID: CPS-2668
Change-Id: Ie7f1b2dcaa9883a40bc3e0e9c447b44ddf681f59
Signed-off-by: leventecsanyi <levente.csanyi@est.tech>
|
|
- change gauge name from cmHandlesByState to cps_ncmp_inventory_cm_handles_by_state
Issue-id: CPS-2662
Change-Id: I97a036b155b638cd6ed94a739e3dffb7b6f00172
Signed-off-by: emaclee <lee.anjella.macabuhay@est.tech>
|
|
- configure restart policy as on-failure and set it to restart atmost 3
times and then give up.
- prevents infinite restart
- gives an opportunity to the app to heal on its own
Issue-ID: CPS-2647
Change-Id: I6cd0c8601060ec518df1f10bcd0dac1b97f622d8
Signed-off-by: mpriyank <priyank.maheshwari@est.tech>
|
|
|
|
cps-and-ncmp
Issue-ID: CPS-2643
Change-Id: I57ba18dcaa71f4c2c5d985e9c870569e40d26880
Signed-off-by: halil.cakal <halil.cakal@est.tech>
|
|
- Bumped dmi version used by docker
Issue-ID: CPS-2504
Change-Id: I7b61aa692432ae79fc26b1c93f7f8a9981d9bcdd
Signed-off-by: seanbeirne <sean.beirne@est.tech>
|
|
To improve reliability and consistency of healthchecks in
k6 and CSIT tests, they are implemented in docker-compose.
This commit adds fail-fast logic, where k6 and CSITs will
abort immediately if the containers won't start.
Implementation:
- Add healthchecks for docker containers used in tests.
- Change k6 & CSIT tests to use docker healthchecks.
- Tests will abort if containers are not healthy.
- Start-up timeout for CPS containers is 90 seconds
- Start-up timeout for other containers is 60 seconds
Other Improvements:
- Add --quiet-pull option to suppress junk output in Jenkins logs.
- Add kpi.env file containing environment variables for KPI pipeline,
just like endurance.env. This allows same code to run either suite.
- Changed from port range to port number for Postgres exporter, since
only a single instance runs, to be consistent with other containers
Issue-ID: CPS-2630
Signed-off-by: danielhanrahan <daniel.hanrahan@est.tech>
Change-Id: I50929ca4061bb844fca87a0d6c3103aaa2c45e0b
|
|
- if swap memory is active on any host, the docker-compose will add it
on top of JVM process memory. This will exceed the total memory
reservation of the container, eventually cause restart.
- swap limit can not be lower than the total memory limit, thus given
3G. this will disallow docker-compose to use swap space
Issue-ID: CPS-2628
Change-Id: I01bd4a1fd3f4e894fe8236bbc38cd7718fb0b6fb
Signed-off-by: halil.cakal <halil.cakal@est.tech>
|
|
|
|
- The profile 'monitoring' prevents postgres-exporter service to run for
both kpi and endurance pipelines. As a result, the postgres dashboard
does not work on Nordix Grafana.
Issue-ID: CPS-2585
Change-Id: I272ab609c60094bb347ef405787679860cc43595
Signed-off-by: halil.cakal <halil.cakal@est.tech>
|
|
- hikari dashboard will insight into db connection pool such as
connection count e.g. min, max, and idle.
- change coloring for some panel from JVM dashboard
- add tags to the dashboards
Issue-ID: CPS-2592
Change-Id: Ibfb49ab5c1e338d04a7c8f18a413b5db7b8d5ad7
Signed-off-by: halil.cakal <halil.cakal@est.tech>
|
|
|
|
|
|
Issue-ID: CPS-2616
Signed-off-by: danielhanrahan <daniel.hanrahan@est.tech>
Change-Id: Ie179c6e16cb9c49944b64665aba89c85029f4268
|
|
- if kpi and endurance suits runing together at the same time,
instance flag will help to filter either of them. otherwise the
stats for both suits will be visible on a single plot together.
Issue-ID: CPS-2585
Change-Id: I422adb0afd3a618c477989a9396636f63808c275
Signed-off-by: halil.cakal <halil.cakal@est.tech>
|
|
- Replaced KafkaContainer with ConfluentKafkaContainer to fix the
deprecation issue
- Bumped the version of cp-kafka and cp-zookeeper to 7.8.0
- Removing reference of registry.nordix as the NCMP internal pipeline is no
longer used
Issue-ID: CPS-2608
Change-Id: Ic1f3edbc06d63c5f00c1ecb901e07a6bc17ae6eb
Signed-off-by: mpriyank <priyank.maheshwari@est.tech>
|
|
- add postgres-exporter service to expose postgresql db metrics to prometheus
- add new job scraper to prometheus for both kpi and endurance jobs
- add new dashboard: postgresql-statistics to the grafana
Issue-ID: CPS-2585
Change-Id: Ia7ede41d00364918b1054d9b5a38e2feea4fc8e5
Signed-off-by: halil.cakal <halil.cakal@est.tech>
|
|
|
|
Issue-ID: CPS-2583
Signed-off-by: danielhanrahan <daniel.hanrahan@est.tech>
Change-Id: I9c7a543201a63c1f128737bb0995843af6075a6b
|
|
- data dashboard for NCMP Data operations
(NetworkCmProxyController.java)
Issue-ID: CPS-2537
Change-Id: Ic518e9169057691702d0d760468e114f5a9fbd7d
Signed-off-by: halil.cakal <halil.cakal@est.tech>
|
|
|
|
- Increase CM-handles to 50000 in k6 tests
- Increase container memory to 3GB (crashes with 2GB)
- Decrease max heap percentage to 70% to ensure container has
at least 200MB free memory, to prevent JVM from crashing at
higher peak loads.
Issue-ID: CPS-2584
Signed-off-by: danielhanrahan <daniel.hanrahan@est.tech>
Change-Id: I8daabff58723114bdc3838b379e83c3ffbbb0ba9
|
|
- rename lcm-state dashboard to inventory
- remove panels holding a single state of cm handle
- add panel for inventory APIs calls by URI and the STATUS
Issue-ID: CPS-2567
Change-Id: I374fa18b53d456519142cf059f74b2a070485355
Signed-off-by: halil.cakal <halil.cakal@est.tech>
|
|
Heap percent was commented out by mistake leading to OutOfMemoryError
in endurance pipeline, due to only 25% of memory being used for heap.
Issue-ID: CPS-2570
Signed-off-by: danielhanrahan <daniel.hanrahan@est.tech>
Change-Id: Ifbffe7222e38f017105cfac8bc8307255d28d11a
|
|
|
|
- adding a grafana dashboard for lcm state changes of
cm handles
Issue-ID: CPS-2558
Change-Id: Ie00884736ffa5c08e1be150c71cf8ac603f2ab60
Signed-off-by: halil.cakal <halil.cakal@est.tech>
|
|
- Registration: create and upgrade cases.
- Handle moduleSetTag deletion (all orphans) for testware
- Unit tests updated
- additional logging of details for upgrade scenarios
- Integration Tests updated
- Remove cache for module sets being processed
- Removed DbCleaner (startup)
- Removed redundant methods in NCMP Inventory for deleting schema set(s)
- Removed validation check for all schema set interactions
- Updated some schema set tests to use special characters previously not allowed
- Checked integration test scenarios for upgrades with and without tags: all scenarios covered!
TODO
- REST endpoint to remove orphaned schema set data, separate story: CPS-2554
- Investigate exception handling regarding DuplicateYangResourceException: CPS-2555
Issue-ID: CPS-2540
Signed-off-by: ToineSiebelink <toine.siebelink@est.tech>
Change-Id: Iaa59cbdb86b7a4a8044624829bc002506ff40cc7
|
|
- change image of grafana to grafana/grafana from grafana/grafana-oss
in local docker-compose file
Issue-ID: CPS-2559
Change-Id: I03bf895c10a108c9aeae2db6b2dc6b39db505df5
Signed-off-by: halil.cakal <halil.cakal@est.tech>
|
|
- add healthcheck to ensure prometheus service is running correctly and
restarts it if it becomes unhealthy
- set the data retention time to 30d
- update readme
Issue-ID: CPS-2466
Change-Id: I0f6186f8577e8e0113831ed040298b823e33d624
Signed-off-by: halil.cakal <halil.cakal@est.tech>
|
|
- assing port range to cps-and-ncmp service to plot metrics
separately
Issue-ID: CPS-2536
Change-Id: Ic097017bbcca25f28341e8e77f658a0a4e8964d7
Signed-off-by: halil.cakal <halil.cakal@est.tech>
|
|
- Reverted DMI_VERSION to latest Ncmp-dmi-Plugin 1.6.0-SNAPSHOT-latest
for CSIT.
- Corrected dmi-stub docker image name from "dmi-plugin-demo-and-csit-stub" to "dmi-stub"
Issue-ID: CPS-2499
Change-Id: I99087b7230a40e958b3b789549a4a8e4c9dd0341
Signed-off-by: sourabh_sourabh <sourabh.sourabh@est.tech>
|
|
- add new config file containing env variables for endurance test
- add notes to describe how to run and stop it
Note 1: Existing docker-compose deployment won't be affected by this
changes
Note 2: Both docker-compose can be deployed simultaneously without any
issue
Issue-ID: CPS-2463
Change-Id: I15936718dda2f31a5077b9c36eded21c1a85dfcd
Signed-off-by: halil.cakal <halil.cakal@est.tech>
|
|
Issue-ID: CPS-2403
Signed-off-by: danielhanrahan <daniel.hanrahan@est.tech>
Change-Id: I614313461bd36ca351a8d42d9683b88460090b50
|
|
|
|
- remove hard-coded container names from load balancer(nginx) and
prometheus
Issue-ID: CPS-2437
Change-Id: Ia64b949c2eb3e4d67e123b046e66d5bfdda2c801
Signed-off-by: halil.cakal <halil.cakal@est.tech>
|
|
- limit the total memory with 2G
- set heap memory 75% of the total RAM
Issue-ID: CPS-2180
Change-Id: Ia8a6c2bd40866052caba4f332ac563e1ed829212
Signed-off-by: halil.cakal <halil.cakal@est.tech>
|
|
(fighting between IntelliJ and Checkstyle best practices)
- non-2xx responses are processed using web client exceptions
- handle unknown host exception
- upgraded spotbugs (checkstyle and related mvn plugin)
- fixed some small spotbugs due to upgrade
- added commented instructions in docker compose to enable debugging
- added some environment variables for policy executor configuration
- extract out Sleeper in stub service to achieve 100% coverage
- added cause to Policy Executor exceptions where applicable
- ignored (new) spotbug rule about catch NPE because of issue in 3pp
- ignored (new) spotbug rule about \n in string due to multiline string block
Issue-ID: CPS-2412
Change-Id: I6835a73320c436cbeea12cc7a06f15899eec7bf1
Signed-off-by: ToineSiebelink <toine.siebelink@est.tech>
|
|
By default, nginx uses round robin for load balancing. Round robin
works well when all requests take same amount of time, but is not
optimal for dynamic loads.
With least-connected load balancing, nginx will try not to overload
a busy server with excessive requests, sending requests to a less
busy server instead.
Issue-ID: CPS-2415
Signed-off-by: danielhanrahan <daniel.hanrahan@est.tech>
Change-Id: I0e7d6b36a79aa6541f2f5f667746f2792d55e54b
|
|
- Remove servers from yaml.
- Add prefix to path in yaml.
- Remove basepath prefix from Controller Impl.
- Added policy server port to 8093.
- Exposed docker env. vars for policy services.
Issue-ID: CPS-2291
Change-Id: Idcc9e23f9d63bad2480537dc8d39523fe82b4b83
Signed-off-by: ToineSiebelink <toine.siebelink@est.tech>
Signed-off-by: sourabh_sourabh <sourabh.sourabh@est.tech>
|
|
- Overrides initial processing delay of every new module set tag for DMI plugin stub from 2 min (default) to 0 as a temp. solution.
Issue-ID: CPS-2395
Change-Id: Ic875262314406ec55457e31a69c00d55293c1b74
Signed-off-by: sourabh_sourabh <sourabh.sourabh@est.tech>
|
|
- Specify both cps-and-ncmp instances in Prometheus config
- Add default JVM Micrometer dashboard to Grafana, taken from
https://grafana.com/grafana/dashboards/4701-jvm-micrometer/
Issue-ID: CPS-2314
Signed-off-by: danielhanrahan <daniel.hanrahan@est.tech>
Change-Id: I24a42b8b4771a7d9341e920fbd85fa72e1a3518e
|
|
- Applied a filter on tasks.scheduled.execution.
- It is configurable from application.ymal into cps.scheduled-task-names: - tasks.scheduled.execution by default.
- Added a new property configuration bean for open telemetry.
- Introduced new propery attribute that takes list of task names to be
filtered.
Issue-ID: CPS-2250
Change-Id: I8e5334cd166343181df6dd1985471f5465f14f0e
Signed-off-by: sourabh_sourabh <sourabh.sourabh@est.tech>
|
|
- add kafka support to k6 codebase
- add two new tests: data operation batch passthrough read
and data operation async batch read (from kafka)
- calculate the events/second via Trend
- add kafka ui support to docker-compose
Note: Before this commit being merged, the k6 executable
should have been compiled with the kafka extension of xk6-kafka
Issue-ID: CPS-2268
Change-Id: Ib7777b7bc9f15b210ea36d3541cba0e0c943f883
Signed-off-by: halil.cakal <halil.cakal@est.tech>
|
|
- set dmi delay to 670 ms for passthrough write (CUD)
- set dmi delay to 300 ms for passthrough read
Issue-ID:CPS-2285
Change-Id: I7a4294d3ed04c89aeb5988a9975516135317af12
Signed-off-by: halil.cakal <halil.cakal@est.tech>
|
|
Issue-ID: CPS-2038
Change-Id: Iba8950db4f3fcfc13f32bf915189f5fcb2ccee90
Signed-off-by: halil.cakal <halil.cakal@est.tech>
|
|
|
|
- Generate interface from OpenApi in RTD docs
- Fixed Content Type in OpenAPI
- Fixed Paths in OpenAPI
- Made Authorization header compulsory in OpenAPI
- All 'enum' values in OpenAPI lowercase (in linr with CPS/NCMP conventions)
- Added impl with some basic functionality
- Added testware
- Pom includes docker image creation
- Docker compose updated to deploy stub
Issue-ID: CPS-2301
Change-Id: I462ad5c70474b2813fc04005c0d20a1b15b574ec
Signed-off-by: ToineSiebelink <toine.siebelink@est.tech>
Signed-off-by: sourabh_sourabh <sourabh.sourabh@est.tech>
|
|
- this change for two test suits: CSIT and K6
- increase cps-and-ncmp replicas to 2
- introduce nginx as load-balancer
- set proxies inside nginx for the cps-and-ncmp URI paths
- upgrade hazelcast-spring to 5.3.7
Issue-ID: CPS-2038
Change-Id: If1b6c1e1218af16b24cc061507ea9b25d277d143
Signed-off-by: halil.cakal <halil.cakal@est.tech>
|