Thanks Markus, although I'm not sure I fully understand your reply...
If I try to do a transfer between two agents that are each using a ca.pem file from a different AE (which is a perfectly legitimate scenario), that shouldn't prevent the file transfer from working, should it?
Original Message:
Sent: Feb 11, 2025 04:33 AM
From: Markus Embacher
Subject: random ca.pem errors with file transfers?
Hi Daryl,
I had a chat with @Oana Botez: the certificates you see in /agent/bin/security are internal certificates which the Automation Engine issues. Those certificates are used to encrypt the file transfer between agents. Every AE will have a different internal CA in order to create those internal certificates. That is why you are seeing different ca.pem files for every AE. If you would like to use your own CA for those internal certificates please have a look at:
https://docs.automic.com/documentation/webhelp/english/ALL/components/DOCU/24.3.0/Automic%20Automation%20Guides/Content/AWA/Variables/UC_AGENT_TLS_SETTINGS.htm
The issue you are facing is because of the following reason:
*) agent connects to AE1 and receives ca.pem file to store in /bin/security
*) now you connect the same agent to AE2, the agent does not receive a new ca.pem file, because the existing one is still valid
*) as soon as you run a file transfer the agent uses the existing ca.pem file, but it does not match the CA which AE2 is using and therefore the TLS handshake fails
*) after cleaning up the /bin/security folder the agent receives a new ca.pem file (signed by AE2) and the file transfer works as expected
Hope that help!
Regards, Markus
Original Message:
Sent: Jan 22, 2025 11:57 AM
From: Daryl Brown
Subject: random ca.pem errors with file transfers?
Update:
We've made another discovery here. The file transfer issues seem to occur when the two agents in question had made their initial connections to different AE servers to obtain their *ca.pem files in the agent/bin/security folder. In these cases, the file transfers end up failing because the signatures in these two ca.pem files don't match.
When we've reset these agents (meaning cleared the agent/bin/security folder, reset the public key, and restarted the agent) and confirmed that both source and destination agents are using ca.pem files obtained from the same AE (regardless of which JCP that might have been), then our transfers work.
Now we've since obtained a new multi-server cert listing the names of both of our AEs in the same cert, and deployed this to both of our AEs. I had expected this would result in agents' ca.pem files looking the same regardless of which AE they connect to now (since the cert on those AEs are the same), but we're still having the same problem -- the ca.pem files look different depending on which AE they connect to.
Has anyone else experienced this? Or have any more insight on why the signatures on these ca.pem files would be different?
Original Message:
Sent: Jan 15, 2025 05:04 PM
From: Daryl Brown
Subject: random ca.pem errors with file transfers?
We've been running into this issue for a while now in our 24.3 environment, and are still working with Broadcom Support to get to the bottom of it.
We have a ton of file transfers that we need to do between various agents, and while we've been in the process of validating that everything works before we migrate various workflows into this 24.3 environment, we've been running into a lot of file transfers erroring out like this:
20250110/174713.406 - U00063097 FT '3070179': Initiating connection to Agent 'VMAZ1UTUC4ARC06'.
20250110/174713.407 - U00063013 FT '3070179': Connecting to '10.86.95.12' at port '23020'.
20250110/174713.410 - U02000407 Initiating connection '1' to server using WebSocket URI: 'wss://10.86.95.12:23020/agent'.
20250110/174713.456 - U02000378 Loading certificates from directory: '/opt/aeluat/agent/bin/./security'.
20250110/174713.458 - U02000377 Certificate loaded from file 'NXUVARCS01001_ca.pem'.
20250110/174713.528 - U02000385 Web socket error: 'SSL_HANDSHAKE_EXCEPTION'.
20250110/174713.528 - IOException: javax.net.ssl.SSLHandshakeException: signature check failed
20250110/174713.529 - -> SSLHandshakeException: signature check failed
20250110/174713.529 - -> ExtendedCertificateException: signature check failed
20250110/174713.529 - -> SignatureException: Signature does not match.
20250110/174713.529 - U00000096 SSL Certificate validation error: 'VMAZ1UTUC4ARC06'.
20250110/174713.546 - Root certificate details -> Issued To:'CN=AE Agent Certificate' Issued By:'CN=AE Agent Certificate' Valid to:'Mon Jan 02 12:58:05 EST 2045' Serial:'1014500577571917941685596234770587331478683926940'
20250110/174713.548 - U02000230 FT '3070179': Thread 'Thread[FT3070179,5,main]' ended.
20250110/174713.585 - U00045014 Exception 'java.nio.file.AccessDeniedException: "./security/VMAZ1UTUC4ARC06_ca.pem"' at 'sun.nio.fs.UnixException.translateToIOException():90'.
20250110/174713.586 - U00003620 Routine 'com.uc4.ex.AdminRoutine' forces trace because of error.
20250110/174713.587 - U00003450 The TRACE file was opened with the switches '0000000000000000'.
20250110/174713.928 - U00003449 Output to the TRACE file is finished.
This isn't happening with every pair of agents, but these errors do occur consistently until we take action. What's odd is that those same two agents are able to do file transfers with other agents just fine.
Broadcom Support has indicated that when we see errors like this, we're supposed to stop the agent + delete the files from the agent/bin/security/ folder(s) + reset the agent public key (from client 0) + restart the agent. We've done this, and it has sometimes fixed the issue -- not always -- but we've also seen this "solution" cause other file transfers involving these agents that were previously working to now start throwing similar errors. So I don't have a lot of faith in this solution right now.
Has anyone else encountered these issues in their environment? If so, how did you address it?