[JENKINS-57240] Only last stage data is sent to influx db from jenkins pipeline

Type: Bug
Resolution: Unresolved
Priority: Major
Component/s: influxdb-plugin, pipeline
Labels:
None
Environment:
Jenkins pipeline 2.150.2
InfluxDB 1.6.2

Similar Issues:
Powered by SuggestiMate

Show

I have below code which reads each 'stage' data from a jenkins_pipeline project json output and sends each stage data to influxDB.

Issue: It sends only last stage data to influx db but I noticed it iterating on each stages

Any suggestions would be helpful.
//Methods for InfluxData begins
//Maps for Field type columns
 myDataField1 = [:]
 myDataField2 = [:]
 myDataField3 = [:]
//Maps for Custom Field measurements
 myCustomDataFields1 = [:]
 myCustomDataFields2 = [:]
 myCustomDataFields3 = [:]
//Maps for Tag type columns
 myDataTag1 = [:]
 myDataTag2 = [:]
 myDataTag3 = [:]
//Maps for Custom Tag measurements
 myCustomDataTags1 = [:]
 myCustomDataTags2 = [:]
 myCustomDataTags3 = [:]
@NonCPS
 def pushStageData() {
def url_string = "${JENKINS_URL}job/ENO_ENG_TP/job/R421/13/wfapi/describe"
 def replaced = url_string.replaceAll(' ', '%20');
 get = null;
 def get = new URL(replaced).openConnection();
get.addRequestProperty ("User-Agent","Mozilla/4.0"); 
 get.addRequestProperty("Authorization", "Basic dZXZvceDIwMTk=");
//fetching the contents of the endpoint URL
 def jsonText = get.getInputStream().getText();
 //converting the text into JSON object using JsonSlurperClassic 
 def jsonObject = new JsonSlurperClassic().parseText(jsonText)
// Extracting the details of all the stages present in that particular build number
 for (int i=0; i<jsonObject.stages.size()-1; i++){ //size-1 to ignore the post stage
 //populating the field type columns of InfluxDB measurements and pushing them to the map called myDataField1
 def size = jsonObject.stages.size()-1
 myDataField1['result'] = jsonObject.stages[i].status
 myDataField1['duration'] = jsonObject.stages[i].durationMillis
 myDataField1['stage_name'] = jsonObject.stages[i].name
//populating the tag type columns of InfluxDB measurements and pushing them to the map called myDataTag1
 myDataTag1['result_tag'] = jsonObject.stages[i].status
 myDataTag1['stage_name_tag'] = jsonObject.stages[i].name
//assigning field type columns to the measurement called CustomData
 myCustomDataFields1['CustomData'] = myDataField1
 //assigning tag type columns to the measurement called CustomData
 myCustomDataTags1['CustomData'] = myDataTag1
//Push the data into influx instance
 try
{ step([$class: 'InfluxDbPublisher', target: 'jenkins_data', customPrefix: null, customDataMapTags: myCustomDataTags1]) }
catch (err)
{ println ("pushStagData exception: " + err) }
}
 }

- - Sort By Name
  - Sort By Date
  - Ascending
  - Descending
  - Thumbnails
  - List
  - Download All

image-2024-09-18-10-51-11-419.png
148 kB
2024-09-18 17:51

is duplicated by

JENKINS-60906 Write list data to <customDataMap>

Closed

Aleksi Simell added a comment - 2019-04-30 09:38 - edited

Time is the primary key in InfluxDB. Now you write all points at some specific point in your job instead of spreading them out when you get the data. This causes InfluxDB to overwrite the point which have the exact same timestamp (this is a feature of InfluxDB itself, not the plugin).

A quick workaround is to sleep for a millisecond after each write to force a new timestamp.

Aleksi Simell added a comment - 2019-04-30 09:38 - edited Time is the primary key in InfluxDB. Now you write all points at some specific point in your job instead of spreading them out when you get the data. This causes InfluxDB to overwrite the point which have the exact same timestamp (this is a feature of InfluxDB itself, not the plugin). A quick workaround is to sleep for a millisecond after each write to force a new timestamp.

sridatta s added a comment - 2019-04-30 10:18

Thank you aleksisimell.

But it seem to be different issue. I have 20 stages in my jenkins pipeline and I see only first stage data is sent to influxdb and the above function is getting exited after one loop. Any suggestions? Sorry for my earlier interpretation.

sridatta s added a comment - 2019-04-30 10:18 Thank you aleksisimell . But it seem to be different issue. I have 20 stages in my jenkins pipeline and I see only first stage data is sent to influxdb and the above function is getting exited after one loop. Any suggestions? Sorry for my earlier interpretation.

Aleksi Simell added a comment - 2019-04-30 10:41

Based on your paste, you're collecting data to `myDataField1` and `myCustomDataFields1`, but never writing those to InfluxDB.

Is the job completing successfully? If it is, then the only thing I can think of right away is that `jsonObject.stages` does not contain 20 objects, but only 2. But I have no experience from JsonSlurper.

Aleksi Simell added a comment - 2019-04-30 10:41 Based on your paste, you're collecting data to `myDataField1` and `myCustomDataFields1`, but never writing those to InfluxDB. Is the job completing successfully? If it is, then the only thing I can think of right away is that `jsonObject.stages` does not contain 20 objects, but only 2. But I have no experience from JsonSlurper.

sridatta s added a comment - 2019-04-30 10:57 - edited

aleksisimell Thank you for the suggestion. Job is getting completed successfully and also I seee 20 objects in the json and jsonObject.stages.size() returns 20. Is it related to @NonCPS annotation? If I comment that it iterates over all stage but I get java.io.serilizable exception?

sridatta s added a comment - 2019-04-30 10:57 - edited aleksisimell Thank you for the suggestion. Job is getting completed successfully and also I seee 20 objects in the json and jsonObject.stages.size() returns 20. Is it related to @NonCPS annotation? If I comment that it iterates over all stage but I get java.io.serilizable exception?

Aleksi Simell added a comment - 2019-06-27 07:10

In your global configuration, have you checked the "Job scheduled time as timestamp" checkbox? If so, then you will have only 1 timestamp, which causes you to overwrite your data every time you write to InfluxDB. Unchecking this and adding a short wait will force a new timestamp for each write you do.

Aleksi Simell added a comment - 2019-06-27 07:10 In your global configuration, have you checked the "Job scheduled time as timestamp" checkbox? If so, then you will have only 1 timestamp, which causes you to overwrite your data every time you write to InfluxDB. Unchecking this and adding a short wait will force a new timestamp for each write you do.

z cloud added a comment - 2019-08-30 07:03

aleksisimell ，According to sridattasp, in the "jenkins_data" table, each build has multiple duplicate data? Is there any way to generate a single record in the "jenkins_data" table each time, and multiple records in the "jenkins_custom_data" table?

z cloud added a comment - 2019-08-30 07:03 aleksisimell ，According to sridattasp , in the "jenkins_data" table, each build has multiple duplicate data? Is there any way to generate a single record in the "jenkins_data" table each time, and multiple records in the "jenkins_custom_data" table?

Aleksi Simell added a comment - 2019-09-02 12:27

zyun823 That is currently not possible. You're only able to add keys and values to a single "jenkins_custom_data" measurement. I can check how big the workload would be to change the functionality so that you would be able to send multiple measurements for the same custom data in a single call from InfluxDbPublisher.

Aleksi Simell added a comment - 2019-09-02 12:27 zyun823 That is currently not possible. You're only able to add keys and values to a single "jenkins_custom_data" measurement. I can check how big the workload would be to change the functionality so that you would be able to send multiple measurements for the same custom data in a single call from InfluxDbPublisher.

z cloud added a comment - 2019-09-03 07:06

aleksisimell ，Thank you for your reply and look forward to this feature. The customDataMap and customDataMapTags support list(map) will be great.

For example:

stageDataMapTagsList = []

stageDataMapTags = [:]

stageDataMapTagsList.add(stageDataMapTags)

customDataMapTags['stage'] = stageDataMapTagsList

z cloud added a comment - 2019-09-03 07:06 aleksisimell ，Thank you for your reply and look forward to this feature. The customDataMap and customDataMapTags support list(map) will be great. For example: stageDataMapTagsList = [] stageDataMapTags = [:] stageDataMapTagsList.add(stageDataMapTags) customDataMapTags ['stage'] = stageDataMapTagsList

Raz added a comment - 2024-09-06 13:21 - edited

How is batch processing represented using the Jenkins influxDBPublisher?

In your GitHub repo I saw that there's an array of JUnit test results being processed: https://github.com/jenkinsci/influxdb-plugin/blob/8244a8d0a703f66778a1c8a64ed26d0622f334dd/src/main/java/jenkinsci/plugins/influxdb/generators/JUnitPointGenerator.java Can I use some similar API that accepts custom data per test?

return points.toArray(new Point[0]);

private void writeToInflux(Target target, InfluxDBClient influxDB, List<Point> pointsToWrite) {
        /*
         * build batchpoints for a single write.
         */
        try {
            influxDB.getWriteApiBlocking().writePoints(pointsToWrite);

For one thing, is this API exposed for Jenkins pipeline? How can I use that in a Jenkinsfile. Also, Is each of these points stored in InfluxDB as its own row?

With thousands of tests it's not clear what overload of InfluxDBPublisher() I should use for writing multiple rows in the DB for a single data map.

My current use case is something like

testData.each {data ->
  influxDbPublisher( selectedTarget: 'jenkins_data', customData: data)
}

However with hundreds -> thousands of tests this takes forever (literally 20+ minutes). I'd like an equivalent invocation for batch publishing as mentioned in the documentation for influxDB: https://www.influxdata.com/glossary/batch-processing-explained/

Raz added a comment - 2024-09-06 13:21 - edited How is batch processing represented using the Jenkins influxDBPublisher? In your GitHub repo I saw that there's an array of JUnit test results being processed: https://github.com/jenkinsci/influxdb-plugin/blob/8244a8d0a703f66778a1c8a64ed26d0622f334dd/src/main/java/jenkinsci/plugins/influxdb/generators/JUnitPointGenerator.java Can I use some similar API that accepts custom data per test? return points.toArray( new Point[0]); private void writeToInflux(Target target, InfluxDBClient influxDB, List<Point> pointsToWrite) { /* * build batchpoints for a single write. */ try { influxDB.getWriteApiBlocking().writePoints(pointsToWrite); For one thing, is this API exposed for Jenkins pipeline? How can I use that in a Jenkinsfile. Also, Is each of these points stored in InfluxDB as its own row? With thousands of tests it's not clear what overload of InfluxDBPublisher() I should use for writing multiple rows in the DB for a single data map. My current use case is something like testData.each {data -> influxDbPublisher( selectedTarget: 'jenkins_data' , customData: data) } However with hundreds -> thousands of tests this takes forever (literally 20+ minutes). I'd like an equivalent invocation for batch publishing as mentioned in the documentation for influxDB: https://www.influxdata.com/glossary/batch-processing-explained/

Aleksi Simell added a comment - 2024-09-10 06:54

Hi zargold,

I was about to write instructions how to do this with CustomDataMap, but looking at the code and your suggestion, I say this is not supported for custom data. For example Robot Framework results process sub points individually and a similar functionality could indeed be added to CustomDataMap data generation.

Aleksi Simell added a comment - 2024-09-10 06:54 Hi zargold , I was about to write instructions how to do this with CustomDataMap, but looking at the code and your suggestion, I say this is not supported for custom data. For example Robot Framework results process sub points individually and a similar functionality could indeed be added to CustomDataMap data generation.

Raz added a comment - 2024-09-18 18:02

Hi aleksisimell,

So to be clear if I use JUnit: I would indicate a junit.xml file to process.
It would process my JUnit output and it convert it to Points that look something like:

[{
test_name: "convertMoney: it converts numbers to dollars",
test_duration: 0.23,
test_status: "pass"
}, {
test_name: "convertMoney: it converts dollars to numbers",
test_duration: 0.54,
test_status: "pass"
}...5000 unit tests]

When storing the JUnit data it will:

store all 5000 unit tests ("points") in a single/fast API request to InfluxDB
Each Unit test ("point") will be stored in it's own row in influxDB.

So that I can query something like:

SELECT TOP 3 * FROM "jenkins_custom_data" WHERE test_duration <> nil ORDER BY test_duration DESC;

to get the 3 slowest tests?

But there's no way to provide custom "Points" and batch send those to be stored each as it's own row right?

Raz added a comment - 2024-09-18 18:02 Hi aleksisimell , So to be clear if I use JUnit: I would indicate a junit.xml file to process. It would process my JUnit output and it convert it to Points that look something like: [{ test_name: "convertMoney: it converts numbers to dollars" , test_duration: 0.23, test_status: "pass" }, { test_name: "convertMoney: it converts dollars to numbers" , test_duration: 0.54, test_status: "pass" }...5000 unit tests] When storing the JUnit data it will: store all 5000 unit tests ("points") in a single/fast API request to InfluxDB Each Unit test ("point") will be stored in it's own row in influxDB. So that I can query something like: SELECT TOP 3 * FROM "jenkins_custom_data" WHERE test_duration <> nil ORDER BY test_duration DESC; to get the 3 slowest tests? But there's no way to provide custom "Points" and batch send those to be stored each as it's own row right?

Assignee:: Aleksi Simell

Reporter:: sridatta s

Votes:: 1 Vote for this issue

Watchers:: 5 Start watching this issue

Created:: 2019-04-30 08:14

Updated:: 2024-09-18 18:02

Jenkins

Details

Description

Attachments

Attachments

Issue Links

Activity

Collapse comment: Aleksi Simell added a comment - 2019-04-30 09:38, Edited by Aleksi Simell - 2019-04-30 09:39

Expand comment: Aleksi Simell added a comment - 2019-04-30 09:38, Edited by Aleksi Simell - 2019-04-30 09:39

Collapse comment: sridatta s added a comment - 2019-04-30 10:18

Expand comment: sridatta s added a comment - 2019-04-30 10:18

Collapse comment: Aleksi Simell added a comment - 2019-04-30 10:41

Expand comment: Aleksi Simell added a comment - 2019-04-30 10:41

Collapse comment: sridatta s added a comment - 2019-04-30 10:57, Edited by sridatta s - 2019-04-30 11:06

Expand comment: sridatta s added a comment - 2019-04-30 10:57, Edited by sridatta s - 2019-04-30 11:06

Collapse comment: Aleksi Simell added a comment - 2019-06-27 07:10

Expand comment: Aleksi Simell added a comment - 2019-06-27 07:10

Collapse comment: z cloud added a comment - 2019-08-30 07:03

Expand comment: z cloud added a comment - 2019-08-30 07:03

Collapse comment: Aleksi Simell added a comment - 2019-09-02 12:27

Expand comment: Aleksi Simell added a comment - 2019-09-02 12:27

Collapse comment: z cloud added a comment - 2019-09-03 07:06

Expand comment: z cloud added a comment - 2019-09-03 07:06

Collapse comment: Raz added a comment - 2024-09-06 13:21, Edited by Raz - 2024-09-06 14:14

Expand comment: Raz added a comment - 2024-09-06 13:21, Edited by Raz - 2024-09-06 14:14

Collapse comment: Aleksi Simell added a comment - 2024-09-10 06:54

Expand comment: Aleksi Simell added a comment - 2024-09-10 06:54

Collapse comment: Raz added a comment - 2024-09-18 18:02

Expand comment: Raz added a comment - 2024-09-18 18:02

People

Dates