--Task-- name: Monitoring_start enabled: True class_name: MonitoringStart source_name: controller source_namespace: >default< target_name: controller target_namespace: >default< start: 0.0 stop: None timeout: no timeout loop: False interval: None dependencies: [] wait_for: [] preceding_task: None options: {} group_name: None Current dir: /mnt/disks/data/xslou/lodestar-fork/pyrock ________________________________________________________________________________ [2024-04-21 22:10:59] Monitoring_start pre : Initialising task parameters ________________________________________________________________________________ task will be executed on controller (localhost) ________________________________________________________________________________ [2024-04-21 22:10:59] Monitoring_start step1 : N/A ________________________________________________________________________________ 2024-04-21 22:11:59,339 INFO 2024-04-21 22:11:59,339 INFO **************************************** Start Lodemon **************************************** 2024-04-21 22:11:59,474 INFO 2024-04-21 22:11:59,474 INFO [loop_until]: kubectl --namespace=xlou --context=gke_engineeringpit_us-east1-d_xlou-cdm delete configmap lodemon-config 2024-04-21 22:11:59,474 INFO [loop_until]: (max_time=180, interval=5, expected_rc=[0, 1] 2024-04-21 22:11:59,906 INFO [loop_until]: OK (rc = 0) 2024-04-21 22:11:59,906 DEBUG --- stdout --- 2024-04-21 22:11:59,906 DEBUG configmap "lodemon-config" deleted 2024-04-21 22:11:59,906 DEBUG --- stderr --- 2024-04-21 22:11:59,906 DEBUG 2024-04-21 22:11:59,907 INFO 2024-04-21 22:11:59,907 INFO [loop_until]: kubectl --namespace=xlou --context=gke_engineeringpit_us-east1-d_xlou-cdm create configmap lodemon-config --from-file=/mnt/disks/data/xslou/lodestar-fork/config/config.yaml 2024-04-21 22:11:59,907 INFO [loop_until]: (max_time=180, interval=5, expected_rc=[0] 2024-04-21 22:12:00,233 INFO [loop_until]: OK (rc = 0) 2024-04-21 22:12:00,233 DEBUG --- stdout --- 2024-04-21 22:12:00,233 DEBUG configmap/lodemon-config created 2024-04-21 22:12:00,233 DEBUG --- stderr --- 2024-04-21 22:12:00,233 DEBUG 2024-04-21 22:12:00,241 INFO 2024-04-21 22:12:00,241 INFO [loop_until]: kubectl --namespace=xlou --context=gke_engineeringpit_us-east1-d_xlou-cdm delete configmap lodemon-deployments 2024-04-21 22:12:00,241 INFO [loop_until]: (max_time=180, interval=5, expected_rc=[0, 1] 2024-04-21 22:12:00,667 INFO [loop_until]: OK (rc = 0) 2024-04-21 22:12:00,668 DEBUG --- stdout --- 2024-04-21 22:12:00,668 DEBUG configmap "lodemon-deployments" deleted 2024-04-21 22:12:00,668 DEBUG --- stderr --- 2024-04-21 22:12:00,668 DEBUG 2024-04-21 22:12:00,668 INFO 2024-04-21 22:12:00,668 INFO [loop_until]: kubectl --namespace=xlou --context=gke_engineeringpit_us-east1-d_xlou-cdm create configmap lodemon-deployments --from-file=/mnt/disks/data/xslou/lodestar-fork/config/deployments_to_monitor.yaml 2024-04-21 22:12:00,669 INFO [loop_until]: (max_time=180, interval=5, expected_rc=[0] 2024-04-21 22:12:00,974 INFO [loop_until]: OK (rc = 0) 2024-04-21 22:12:00,974 DEBUG --- stdout --- 2024-04-21 22:12:00,974 DEBUG configmap/lodemon-deployments created 2024-04-21 22:12:00,974 DEBUG --- stderr --- 2024-04-21 22:12:00,974 DEBUG 2024-04-21 22:12:00,978 INFO 2024-04-21 22:12:00,978 INFO ------------- Deploy the lodestarbox with lodemon profile ------------- 2024-04-21 22:12:00,979 INFO 2024-04-21 22:12:00,979 INFO [run_command]: skaffold deploy --profile lodemon --config=/tmp/tmp80c_b3dh --status-check=true --namespace=xlou 2024-04-21 22:12:04,408 INFO Starting deploy... 2024-04-21 22:12:06,215 INFO - serviceaccount/k8s-svc-acct-lodemon unchanged 2024-04-21 22:12:06,316 INFO - clusterrolebinding.rbac.authorization.k8s.io/k8s-svc-acct-crb-xlou unchanged 2024-04-21 22:12:06,517 INFO - deployment.apps/lodemon configured 2024-04-21 22:12:06,617 INFO Waiting for deployments to stabilize... 2024-04-21 22:12:11,739 INFO - xlou:deployment/lodemon: waiting for rollout to finish: 1 old replicas are pending termination... 2024-04-21 22:12:37,225 INFO - xlou:deployment/lodemon is ready. 2024-04-21 22:12:37,326 INFO Deployments stabilized in 30.719 seconds 2024-04-21 22:12:37,427 ERROR There is a new version (2.11.0) of Skaffold available. Download it from: 2024-04-21 22:12:37,427 ERROR https://github.com/GoogleContainerTools/skaffold/releases/tag/v2.11.0 2024-04-21 22:12:37,427 ERROR 2024-04-21 22:12:37,427 ERROR Help improve Skaffold with our 2-minute anonymous survey: run 'skaffold survey' 2024-04-21 22:12:37,427 ERROR To help improve the quality of this product, we collect anonymized usage data for details on what is tracked and how we use this data visit . This data is handled in accordance with our privacy policy 2024-04-21 22:12:37,427 ERROR 2024-04-21 22:12:37,427 ERROR You may choose to opt out of this collection by running the following command: 2024-04-21 22:12:37,427 ERROR skaffold config set --global collect-metrics false 2024-04-21 22:12:37,427 INFO 2024-04-21 22:12:37,427 INFO --------------------- Get expected number of pods --------------------- 2024-04-21 22:12:37,428 INFO 2024-04-21 22:12:37,428 INFO [loop_until]: kubectl --namespace=xlou --context=gke_engineeringpit_us-east1-d_xlou-cdm get deployments --selector app=lodemon --output jsonpath={.items[*].spec.replicas} 2024-04-21 22:12:37,428 INFO [loop_until]: (max_time=180, interval=5, expected_rc=[0] 2024-04-21 22:12:37,749 INFO [loop_until]: OK (rc = 0) 2024-04-21 22:12:37,749 DEBUG --- stdout --- 2024-04-21 22:12:37,749 DEBUG 1 2024-04-21 22:12:37,749 DEBUG --- stderr --- 2024-04-21 22:12:37,749 DEBUG 2024-04-21 22:12:37,750 INFO 2024-04-21 22:12:37,750 INFO ---------------------------- Get pod list ---------------------------- 2024-04-21 22:12:37,750 INFO 2024-04-21 22:12:37,750 INFO [loop_until]: awk -F" " "{print NF}" <<< `kubectl --namespace=xlou --context=gke_engineeringpit_us-east1-d_xlou-cdm get pods --selector app=lodemon --output jsonpath={.items[*].metadata.name}` | grep 1 2024-04-21 22:12:37,750 INFO [loop_until]: (max_time=180, interval=10, expected_rc=[0] 2024-04-21 22:12:38,131 INFO [loop_until]: Function succeeded after 0s (rc=0) - failed to find expected number of element: 1 - retry 2024-04-21 22:12:48,525 INFO [loop_until]: Function succeeded after 10s (rc=0) - failed to find expected number of element: 1 - retry 2024-04-21 22:12:58,994 INFO [loop_until]: Function succeeded after 21s (rc=0) - failed to find expected number of element: 1 - retry 2024-04-21 22:13:09,352 INFO [loop_until]: Function succeeded after 31s (rc=0) - expected number of elements found 2024-04-21 22:13:09,352 INFO [loop_until]: OK (rc = 0) 2024-04-21 22:13:09,352 DEBUG --- stdout --- 2024-04-21 22:13:09,352 DEBUG lodemon-7f68c498d9-gbc5j 2024-04-21 22:13:09,352 DEBUG --- stderr --- 2024-04-21 22:13:09,352 DEBUG 2024-04-21 22:13:09,353 INFO 2024-04-21 22:13:09,353 INFO ------------ Check pod lodemon-7f68c498d9-gbc5j is running ------------ 2024-04-21 22:13:09,353 INFO 2024-04-21 22:13:09,353 INFO [loop_until]: kubectl --namespace=xlou --context=gke_engineeringpit_us-east1-d_xlou-cdm get pods lodemon-7f68c498d9-gbc5j -o=jsonpath={.status.phase} | grep "Running" 2024-04-21 22:13:09,353 INFO [loop_until]: (max_time=360, interval=5, expected_rc=[0] 2024-04-21 22:13:09,699 INFO [loop_until]: Function succeeded after 0s (rc=0) - expected pattern found 2024-04-21 22:13:09,700 INFO [loop_until]: OK (rc = 0) 2024-04-21 22:13:09,700 DEBUG --- stdout --- 2024-04-21 22:13:09,701 DEBUG Running 2024-04-21 22:13:09,701 DEBUG --- stderr --- 2024-04-21 22:13:09,701 DEBUG 2024-04-21 22:13:09,701 INFO 2024-04-21 22:13:09,701 INFO [loop_until]: kubectl --namespace=xlou --context=gke_engineeringpit_us-east1-d_xlou-cdm get pods lodemon-7f68c498d9-gbc5j -o=jsonpath={.status.containerStatuses[*].ready} | grep "true" 2024-04-21 22:13:09,701 INFO [loop_until]: (max_time=360, interval=5, expected_rc=[0] 2024-04-21 22:13:10,021 INFO [loop_until]: Function succeeded after 0s (rc=0) - expected pattern found 2024-04-21 22:13:10,021 INFO [loop_until]: OK (rc = 0) 2024-04-21 22:13:10,021 DEBUG --- stdout --- 2024-04-21 22:13:10,022 DEBUG true 2024-04-21 22:13:10,022 DEBUG --- stderr --- 2024-04-21 22:13:10,022 DEBUG 2024-04-21 22:13:10,022 INFO 2024-04-21 22:13:10,022 INFO [loop_until]: kubectl --namespace=xlou --context=gke_engineeringpit_us-east1-d_xlou-cdm get pod lodemon-7f68c498d9-gbc5j --output jsonpath={.status.startTime} 2024-04-21 22:13:10,022 INFO [loop_until]: (max_time=180, interval=5, expected_rc=[0] 2024-04-21 22:13:10,344 INFO [loop_until]: OK (rc = 0) 2024-04-21 22:13:10,344 DEBUG --- stdout --- 2024-04-21 22:13:10,344 DEBUG 2024-04-21T22:12:06Z 2024-04-21 22:13:10,344 DEBUG --- stderr --- 2024-04-21 22:13:10,344 DEBUG 2024-04-21 22:13:10,345 INFO 2024-04-21 22:13:10,345 INFO ----- Check pod lodemon-7f68c498d9-gbc5j filesystem is accessible ----- 2024-04-21 22:13:11,132 INFO [loop_until]: Function succeeded after 0s (rc=0) - expected pattern found 2024-04-21 22:13:11,132 INFO 2024-04-21 22:13:11,132 INFO ---------- Check pod lodemon-7f68c498d9-gbc5j restart count ---------- 2024-04-21 22:13:11,132 INFO 2024-04-21 22:13:11,132 INFO [loop_until]: kubectl --namespace=xlou --context=gke_engineeringpit_us-east1-d_xlou-cdm get pod lodemon-7f68c498d9-gbc5j --output jsonpath={.status.containerStatuses[*].restartCount} 2024-04-21 22:13:11,132 INFO [loop_until]: (max_time=180, interval=5, expected_rc=[0] 2024-04-21 22:13:11,451 INFO [loop_until]: OK (rc = 0) 2024-04-21 22:13:11,451 DEBUG --- stdout --- 2024-04-21 22:13:11,451 DEBUG 0 2024-04-21 22:13:11,451 DEBUG --- stderr --- 2024-04-21 22:13:11,451 DEBUG 2024-04-21 22:13:11,451 INFO Pod lodemon-7f68c498d9-gbc5j has been restarted 0 times. 2024-04-21 22:13:11,451 INFO 2024-04-21 22:13:11,451 INFO --------------------- Get expected number of pods --------------------- 2024-04-21 22:13:11,452 INFO 2024-04-21 22:13:11,452 INFO [loop_until]: kubectl --namespace=xlou --context=gke_engineeringpit_us-east1-d_xlou-cdm get deployments --selector app=lodemon --output jsonpath={.items[*].spec.replicas} 2024-04-21 22:13:11,452 INFO [loop_until]: (max_time=180, interval=5, expected_rc=[0] 2024-04-21 22:13:11,778 INFO [loop_until]: OK (rc = 0) 2024-04-21 22:13:11,778 DEBUG --- stdout --- 2024-04-21 22:13:11,778 DEBUG 1 2024-04-21 22:13:11,778 DEBUG --- stderr --- 2024-04-21 22:13:11,778 DEBUG 2024-04-21 22:13:11,778 INFO 2024-04-21 22:13:11,778 INFO -------------- Waiting for 1 expected pod(s) to be ready -------------- 2024-04-21 22:13:11,779 INFO 2024-04-21 22:13:11,779 INFO [loop_until]: kubectl --namespace=xlou --context=gke_engineeringpit_us-east1-d_xlou-cdm get deployments lodemon --output jsonpath="ready:{.status.readyReplicas} replicas:{.status.replicas}" | grep "ready:1 replicas:1" 2024-04-21 22:13:11,779 INFO [loop_until]: (max_time=900, interval=30, expected_rc=[0] 2024-04-21 22:13:12,121 INFO [loop_until]: Function succeeded after 0s (rc=0) - expected pattern found 2024-04-21 22:13:12,121 INFO [loop_until]: OK (rc = 0) 2024-04-21 22:13:12,121 DEBUG --- stdout --- 2024-04-21 22:13:12,121 DEBUG ready:1 replicas:1 2024-04-21 22:13:12,121 DEBUG --- stderr --- 2024-04-21 22:13:12,121 DEBUG 2024-04-21 22:13:12,122 INFO Component lodemon is alive 2024-04-21 22:13:13,031 INFO Dumping pod description and logs to /mnt/disks/data/xslou/lodestar-fork/results/pyrock/authn_rest/pod-logs/stack/20240421_221312-after-lodemon-deploy/lodemon-7f68c498d9-gbc5j.txt 2024-04-21 22:13:13,032 INFO Dumping pod description and logs to /mnt/disks/data/xslou/lodestar-fork/results/pyrock/authn_rest/pod-logs/stack/20240421_221312-after-lodemon-deploy/lodemon-7f68c498d9-gbc5j.txt 2024-04-21 22:13:13,032 INFO Check pod logs for errors 2024-04-21 22:13:13,033 INFO 2024-04-21 22:13:13,033 INFO [loop_until]: kubectl --namespace=xlou --context=gke_engineeringpit_us-east1-d_xlou-cdm exec lodemon-7f68c498d9-gbc5j -- curl --fail --silent --show-error http://localhost:8080/monitoring/start 2024-04-21 22:13:13,033 INFO [loop_until]: (max_time=180, interval=5, expected_rc=[0] 2024-04-21 22:13:18,009 INFO [loop_until]: OK (rc = 0) 2024-04-21 22:13:18,009 DEBUG --- stdout --- 2024-04-21 22:13:18,009 DEBUG {"Status": "OK", "Message": "Monitoring has been started"} 2024-04-21 22:13:18,009 DEBUG --- stderr --- 2024-04-21 22:13:18,009 DEBUG 2024-04-21 22:13:18,009 INFO 2024-04-21 22:13:18,009 INFO *************************************** Dumping pod list *************************************** 2024-04-21 22:13:18,009 INFO Dumping pod list to /mnt/disks/data/xslou/lodestar-fork/results/pyrock/authn_rest/pod-logs/stack/after-lodemon-deployment/_pod-list.txt 2024-04-21 22:13:18,539 INFO Dumping platform-config configmap to /mnt/disks/data/xslou/lodestar-fork/results/pyrock/authn_rest/pod-logs/stack/after-lodemon-deployment/_platform-config_configmap.txt 2024-04-21 22:13:19,878 INFO Dumping pod description and logs to /mnt/disks/data/xslou/lodestar-fork/results/pyrock/authn_rest/pod-logs/stack/after-lodemon-deployment/lodemon-7f68c498d9-gbc5j.txt 2024-04-21 22:13:19,879 INFO Dumping pod description and logs to /mnt/disks/data/xslou/lodestar-fork/results/pyrock/authn_rest/pod-logs/stack/after-lodemon-deployment/lodemon-7f68c498d9-gbc5j.txt 2024-04-21 22:13:19,880 INFO Check pod logs for errors ________________________________________________________________________________ [2024-04-21 22:13:19] Monitoring_start post : Post method ________________________________________________________________________________ Setting result to PASS Task has been successfully stopped