Propagate java-attacher errors to Kibana #7832

axw · 2022-04-11T03:10:21Z

When using the java-attacher, an error (e.g. failure to execute java) should be indicated in Kibana somehow. For example, this might be done by setting the status of the APM integration to degraded.

simitt · 2022-04-11T06:52:47Z

@joshdover are there any plans for adding a more fine grained health check UI to Fleet where this might fit? I believe in the past @ruflin mentioned some vague ideas for a health state per agent, listing all the processes that are supposed to be running.

eyalkoren · 2022-04-11T07:09:02Z

I think that if a policy contains both APM Server and APM Agent configurations (probably only relevant to Java agent now, but hopefully will be relevant to others in the future), we can assume this APM Server is only used for local purposes and simply consider the entire APM integration unhealthy if there is an indication that the agent is unhealthy.

ruflin · 2022-04-11T09:07:12Z

@ph @jlind23 @cmacknz Can you chime in on the status and plans on health.

felixbarny · 2022-04-11T14:26:33Z

After APM Server has discovered the Java installation and before it calls the attacher, it should also validate that the Java installation is working as expected.

Currently, APM Server logs this message when invoking the attacher fails: failed to run java attacher: exit status 1.

Checking whether the Java installation is working by invoking java -version (and ideally logging the output to the server logs), helps to see if there's a general issue with the Java setup or if there was something wrong specifically with the attacher.

jackshirazi · 2022-04-14T12:40:16Z

Tested on Windows, I get same - or slightly worse as it can't download the requested version too

13:24:09.146
elastic_agent.apm_server
[elastic_agent.apm_server][error] failed to run java attacher: exit status 1
13:24:09.785
elastic_agent.apm_server
[elastic_agent.apm_server][error] Failed to download requested agent version 1.27.1, please double-check your --download-agent-version setting.
13:24:09.824
elastic_agent.apm_server
[elastic_agent.apm_server][error] failed to run java attacher: exit status 1

cmacknz · 2022-04-14T14:19:24Z

@ph @jlind23 @cmacknz Can you chime in on the status and plans on health.

Improving the agent integration health reporting is tracked under elastic/elastic-agent#100. We are just starting to design what this looks like.

simitt · 2022-05-30T12:29:10Z

Regarding #7832 (comment), it is not yet clear to me whether an integration is supposed to also signal whether or not the Elastic Agent should try to restart the process when reported unhealthy or if there will be more fine granular indication. A restart by the Elastic Agent would not make sense in the described cases. @cmacknz can you already share any more details on how this will look like or expected timelines for the definitions for the healthcheck work?

cmacknz · 2022-05-30T13:16:34Z

@simitt We have been iterating on the design details. The proposal is Integration Status Health Reporting. It was being reworked a bit last week but the high level details are right. I added you to the stakeholder list to make sure you are notified of changes.

The new error reporting mechanism needs to be supported in the agent control protocol, @ph can comment on the timeline for implementing this but I suspect implementation will start in 8.4 sometime.

simitt · 2022-06-03T15:29:01Z

@felixbarny given the above conversation, I don't think it makes sense to implement something in the apm-server before the healthcheck endpoint in the Elastic Agent is defined. What do you think?

felixbarny · 2022-06-03T17:29:30Z

Yes, I agree.
FYI @eyalkoren

axw · 2022-11-15T09:53:14Z

@eyalkoren is looking into splitting the attacher off into its own integration, which would naturally enable surfacing errors. I don't think it makes sense to invest in a lot of changes to Elastic Agent, Fleet, and APM Server in the interim, when we plan to provide a more dedicated integration in the hopefully not too distant future. If needed we can reopen this.

axw added the enhancement label Apr 11, 2022

simitt added the 8.4-candidate label May 30, 2022

simitt removed the 8.4-candidate label Jul 26, 2022

eyalkoren mentioned this issue Sep 7, 2022

Copy Java attacher jar to a tmp directory #8803

Merged

3 tasks

axw closed this as not planned Won't fix, can't repro, duplicate, stale Nov 15, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Propagate java-attacher errors to Kibana #7832

Propagate java-attacher errors to Kibana #7832

axw commented Apr 11, 2022

simitt commented Apr 11, 2022

eyalkoren commented Apr 11, 2022

ruflin commented Apr 11, 2022

felixbarny commented Apr 11, 2022

jackshirazi commented Apr 14, 2022

cmacknz commented Apr 14, 2022

simitt commented May 30, 2022

cmacknz commented May 30, 2022

simitt commented Jun 3, 2022

felixbarny commented Jun 3, 2022

axw commented Nov 15, 2022

Propagate java-attacher errors to Kibana #7832

Propagate java-attacher errors to Kibana #7832

Comments

axw commented Apr 11, 2022

simitt commented Apr 11, 2022

eyalkoren commented Apr 11, 2022

ruflin commented Apr 11, 2022

felixbarny commented Apr 11, 2022

jackshirazi commented Apr 14, 2022

cmacknz commented Apr 14, 2022

simitt commented May 30, 2022

cmacknz commented May 30, 2022

simitt commented Jun 3, 2022

felixbarny commented Jun 3, 2022

axw commented Nov 15, 2022