ARD06: Deployment and Monitoring
Status
Investigating
Context
The technical infrastructure should be monitored so anomalies can be detected and resolved. Those that cannot (or should not) be resolved automatically should be reported.
Decision
No decision yet. Leaning toward using an AI agent(s). The n8n platform provides very powerful capabilities.
Rationale
The n8n platform can run orchestrated workflows that reach out to interact with external systems to query their status, and make changes or issue commands to correct, if possible, any problems detected. It can also assist with recovering servers that have failed. The idea is to create detailed instructions (i.e. prompts) that guide the agent to complete tasks and achieve goals.
Consequences
The intended consequences are to monitor infrastructure to the extent possible, and when a problem is recognized either fix it or, if fixing it is not possible, escalate the problem to the attention of a human that can fix it