Troubleshooting¶

This page covers common issues that may arise when using NITA, along with their solutions.

Quick Diagnostics¶

Run these commands first to assess the state of your NITA installation:

# Check pod status
nita-cmd kube pods

# Check for pod errors
kubectl describe pods -n nita

# View pod logs
kubectl logs <pod-name> -n nita

# Check Kubernetes cluster
kubectl cluster-info
kubectl get nodes

Common Issues¶

Pods Not Starting¶

Symptoms: Pods are in Pending, CrashLoopBackOff, or ImagePullBackOff state.

Diagnosis:

kubectl describe pod <pod-name> -n nita
kubectl get events -n nita --sort-by='.lastTimestamp'

Common Causes:

State	Cause	Solution
`ImagePullBackOff`	Cannot download container image	Check internet, DNS, and certificates
`Pending`	PVC not bound or insufficient resources	Check `kubectl get pvc -n nita`
`CrashLoopBackOff`	Container keeps crashing	Check logs with `kubectl logs`
`ContainerCreating`	ConfigMap or volume missing	Check ConfigMaps exist

Jenkins Pod Won't Start¶

Symptoms: Jenkins pod is in CrashLoopBackOff.

Possible Causes:

Missing ConfigMaps — Jenkins requires jenkins-crt and jenkins-keystore:
```
kubectl get cm -n nita | grep jenkins
```
If missing, recreate them (see Certificates).

Corrupted PVC — The jenkins-home PVC may be corrupted:

kubectl delete pvc jenkins-home -n nita
kubectl apply -f /opt/nita/k8s/jenkins-home-persistentvolumeclaim.yaml
kubectl rollout restart deployment/jenkins -n nita

NITA Webapp Returns HTTP Errors¶

Symptoms: 502 Bad Gateway or connection refused.

Possible Causes:

Webapp pod not running:
```
kubectl get pods -n nita | grep webapp
```

Database connection failed:

kubectl logs <webapp-pod> -n nita | grep -i error
kubectl get pods -n nita | grep db

Proxy misconfigured:

kubectl logs <proxy-pod> -n nita
nita-cmd proxy restart

Build Job Fails / Times Out¶

Symptoms: Build action triggered from Webapp fails with timeout.

error: timed out waiting for the condition on jobs/dump
Build step 'Execute shell' marked build as failure

Most Common Cause: Windows CRLF line endings in project files.

Solution:

# Configure git to not convert line endings
git config --global core.autocrlf false

# Re-clone the NITA repository
rm -rf /opt/nita
git clone https://github.com/Juniper/nita.git /opt/nita

Alternative: Install Git for Windows from git-scm.com and choose "Checkout as-is, commit as-is" during installation.

KUBECONFIG Not Set¶

Symptoms: kubectl commands fail with connection errors.

Solution:

export KUBECONFIG=/etc/kubernetes/admin.conf
# Or for non-root users:
export KUBECONFIG=$HOME/.kube/config

Add to ~/.bashrc for persistence:

echo "export KUBECONFIG=$HOME/.kube/config" >> ~/.bashrc
source ~/.bashrc

Certificate Errors During Installation¶

Symptoms: kubeadm init fails with certificate verification error.

[ERROR ImagePull]: failed to pull image...
tls: failed to verify certificate: x509: certificate signed by unknown authority

Cause: Zero-trust security solution (e.g., Zscaler) intercepting traffic.

Solution: See the Certificate Management — Zscaler section.

Kubernetes Certificates Expired¶

Symptoms: kubectl commands fail, API server unreachable.

Solution:

# Check expiration
sudo kubeadm certs check-expiration

# Renew all
sudo kubeadm certs renew all
sudo systemctl restart kubelet

Ansible/Robot Container Already Running¶

Symptoms: Build or test fails because an ephemeral container is already running.

Solution: Wait for the existing job to complete, or delete it manually:

kubectl get jobs -n nita
kubectl delete job <job-name> -n nita

Cannot Access NITA Webapp¶

Symptoms: Browser shows connection refused on https://<host>:443.

Checklist:

✅ Are all pods running? (nita-cmd kube pods)
✅ Is the proxy pod running?
✅ Are the proxy ConfigMaps created?
✅ Are the certificates valid?
✅ Is the firewall allowing port 443?

AlmaLinux firewall:

sudo firewall-cmd --permanent --add-port=443/tcp
sudo firewall-cmd --reload

Ubuntu firewall:

sudo ufw allow 443/tcp

Resetting NITA¶

Restart All Pods¶

kubectl rollout restart deployment --all -n nita

Full Reset¶

# Delete all NITA resources
kubectl delete namespace nita

# Reset Kubernetes
sudo kubeadm reset
sudo systemctl restart containerd.service

# Re-initialize
sudo kubeadm init --control-plane-endpoint="localhost" --ignore-preflight-errors=NumCPU
export KUBECONFIG=/etc/kubernetes/admin.conf

# Remove taints and deploy Calico
kubectl taint nodes --all node-role.kubernetes.io/control-plane-
kubectl apply -f https://raw.githubusercontent.com/projectcalico/calico/v3.26.0/manifests/calico.yaml

# Re-deploy NITA
cd /opt/nita/k8s && bash apply-k8s.sh

# Re-create ConfigMaps (certificates, proxy config, Jenkins keystore)

Troubleshooting¶

Quick Diagnostics¶

Common Issues¶

Pods Not Starting¶

Jenkins Pod Won't Start¶

NITA Webapp Returns HTTP Errors¶

Build Job Fails / Times Out¶

KUBECONFIG Not Set¶

Certificate Errors During Installation¶

Kubernetes Certificates Expired¶

Ansible/Robot Container Already Running¶

Cannot Access NITA Webapp¶

Resetting NITA¶

Restart All Pods¶

Full Reset¶

Getting Help¶