Although variance is the most mathematically reasonable, it is possibly a little technical for many.
I'd actually intended a simpler-to-interpret measure of average byte change whilst ignoring whether its +ve or -ve. For example:
- edit A, +2 bytes
- edit B, +3 bytes
- edit C, -4 bytes
- edit D, -5 bytes
Average
Average change
Average addition size
accounting only for addition edits A and B
Average reduction size
accounting only for deletion edits C and D
For the four edits above, the average is -1, which is tricky to interpret. Do they just make small deletions? Do they make large deletions and additions that happen to almost net cancel? By splitting the additions and reductions, it becomes possible to see whether they e.g. mostly make lost of small additions with the occasional massive deletion.