Automic Workload Automation

Expand all | Collapse all

Agent loosing connectivity with AE server

  • 1.  Agent loosing connectivity with AE server

    Posted 01-22-2018 04:55 AM
    Hi All,
    I am trying to understand what causes transfer key to expire on agents and how check for key expiry before
    starting a job so that the job doesn't get hung.

    The reason for asking the question is we recently pushed a job to around 95 agents which were all started and running however 
    the jobs got blocked on at least 10 agents, when investigated we found below error in agents log file.

    U02000099 Transfer key could not be loaded. Please check, if the KeyStore exists and the agent is authenticated. 

    As far as I know nothing really changed on these servers.

    Appreciate inputs.

    Zafar



  • 2.  Agent loosing connectivity with AE server

    Posted 01-22-2018 05:40 AM
    Hi

    here is a hint how to get rid of this message.

    https://community.automic.com/discussion/8186/agent-wont-start-because-of-u02000099-transfer-key-could-not-be-loaded

    Background: in the transfer key file (kstr) some infos about the target OS system are stored.

    if they change - e.g. the agent is moved to another OS server the transfer key has to be renewed in clt. 0 in AE.

    this is a security feature to prevent "unwanted" Agents connecting to a AE system.

    cheers, Wolfgang



  • 3.  Agent loosing connectivity with AE server

    Posted 01-22-2018 07:56 AM
    Thanks for the response FrankMuffke

    I know we can transfer the key to re-establish the connection.  I am looking for some information on determining the agents with expired keys and exclude them from job execution list, that way we can prevent the job from getting blocked.

    Additionally, can we trace the change causing the key to expire through logs as I don't see any of the reasons alluded in the link posted to be the cause.

    Thanks in advance,
    Zafar



  • 4.  Agent loosing connectivity with AE server

    Posted 01-22-2018 08:09 AM
    You can use System overview / Agents - "Authenticated" column for this request

    cheers, Wolfgang


  • 5.  Agent loosing connectivity with AE server

    Posted 01-22-2018 08:18 AM

     

    To overcome the problem of blocked jobs you could build Agent Groups from the agents and specify the agent group as the host for the jobs.  

     

    If the transfer key needs to renewed,  the host in the agent group  is not available, so the workload gets dispatched to the next available agent (according to mode ) in the group.

     



  • 6.  Agent loosing connectivity with AE server

    Posted 01-23-2018 04:30 AM
    Thanks Gabor_Szilagyi_7654,

    We will give it a try.

    Zafar