Hi Folks,
I hope someone can point me in the right direction to find the root cause of this error. I have an Azure k8s instance of Neo4j, which I installed using helm with minimal custom config. Using azure disk like this:
volumes:
data:
mode: "volume"
volume:
azureDisk:
diskName: "neo4j-disk-stn"
diskURI: "/subscriptions/696xxx-xxx-xxx6-xx-xxxx/resourceGroups/MC\_rg-kube-xxx-westeu-02\_kube-xxx-02\_westeu/providers/Microsoft.Compute/disks/neo4j-disk-stn"
kind: Managed
Devs are using it for evaluation and today I just spent hours troubleshooting trying to get it up and running after the azureDisk was resized to 256GB by the devs.
The pod is in a crash loop doesn't start at all it fails with the following:
Events:
Type Reason Age From Message
---- ------ ---- ---- -------
Normal Scheduled 24m default-scheduler Successfully assigned neo4j-ee-stn/neo4j-ee-stn-release-0 to aks-neo4j-33193140-vmss00002g
Warning Unhealthy 23m (x6 over 24m) kubelet Startup probe failed: dial tcp 10.244.0.15:7687: connect: connection refused
Normal Pulled 23m (x4 over 24m) kubelet Container image "neo4j:4.4.5-enterprise" already present on machine
Normal Created 23m (x4 over 24m) kubelet Created container neo4j
Normal Started 23m (x4 over 24m) kubelet Started container neo4j
Warning BackOff 4m16s (x100 over 24m) kubelet Back-off restarting failed container
SSL is disabled & I have also increased the startupProbe -> failureTreshhold explicitly;
Any ideas how to tackle this one please?
Thanks!
Emil