Arrow Ballistics Study | 2026

Slow-Motion Measurement & Aggregation

How each TIFF stack becomes per-frame yaw and flex numbers, how those numbers roll up into per-build trajectories, and what the pipeline still gets wrong.

Overview

This page picks up where the slow-motion capture and preparation page leaves off. By that point each surviving shot is a folder of high-speed frames plus a small set of per-shot calibration values (arrow length in pixels, tape reference line, per-build flex envelope).

What we do here is find the arrow in every frame, turn each frame into a yaw number (how far the nock has swung off the direction of travel) and a flex number (how much the shaft is bowing in the middle), filter out bad frames, summarize each shot down to one number at one bow-relative position, and then average across the three repeats at each (build, torque, distance) cell.

The end product is a per-(build, torque, distance) profile of how yaw and flex evolve in the first ~45 ft of flight. The last section, Limitations and Future Capture Changes, is the honest list of what currently leaks into those numbers.

Per-Frame Detection

For each frame in the shot, the detector produces tip and nock pixel coordinates, the angle of the line between them (the chord), 60 sample points along the shaft, and a valid-or-not flag with a reject reason. Everything below runs once per shot, frame by frame.

Subtract the Background

Almost everything in frame does not move. The tape measure, the table grid lines, dust on the lens, stuck sensor pixels: they sit on the same pixels in every frame. The arrow occupies any given pixel for only a few frames as it sweeps across the field of view.

That gap lets us recover the stationary background cheaply. For each pixel, the color it has across the majority of frames is the background; the arrow only ever shows up as the minority. Taking the median across all frames in the shot produces a clean background image. Every frame is then re-expressed as itself minus that background, recentered to a neutral gray. The transformed frames look like a uniform gray field with the moving arrow and any per-frame lighting noise still visible; the stationary clutter is gone.

Find the Arrow as Motion

On the background-subtracted frames, the arrow's position is found by subtracting each frame from the previous one (frame differencing) and keeping the pixels that changed the most. After a small noise-cleanup pass, any pixel whose change exceeds a threshold becomes part of the motion mask for that frame.

The motion mask is not a clean silhouette of the arrow. A long, uniform stretch of shaft looks almost identical frame to frame, so the only pixels that show up as motion are the tip leaving its previous position and entering a new one, the nock doing the same, and any colored bands along the shaft. The detector treats the motion mask as a sparse cloud of "arrow signal" pixels, not as a solid blob to be cropped.

Two whole-frame guards run before any geometry is attempted:

If more than 10% of the frame's pixels are motion-positive, the frame is rejected as a global lighting change (a flicker or exposure shift). A real arrow never covers anything like 10% of the frame.
If fewer than 60 motion pixels survive, the frame is rejected as too quiet to fit. Tiny isolated specks (under 8 pixels) are dropped as sensor noise before the surviving pixels are handed on.

A straight line is then fit through the surviving pixels using a method that down-weights a handful of stray pixels (shadows, sensor blips) rather than letting them drag the line off course the way an ordinary best-fit line would. The result is the arrow's long axis in this frame. Pixels more than about 8 px perpendicular to that line are dropped as outliers; the rest are the inliers, the motion pixels actually attributable to the shaft. If the inliers do not span at least 40% of the calibrated arrow length, the frame is rejected. Shorter spans almost always mean the arrow is partly out of frame or the line locked onto a small cluster of noise.

Identify the Tip and Nock

Initial tip and nock estimates come from the extreme x-bands of the inlier cloud, with a slightly wider window on the nock side so the fletchings get averaged in rather than tugging the nock inward. Those estimates are biased backward, since frame differencing flags both the previous and current arrow positions as motion. A refinement step re-finds each end in the background-subtracted current frame alone, restricted to a narrow band around the fitted axis, where only the current position contributes.

The tip needs one more correction. The silver field points used in this dataset have low contrast against the table and reflect lights differently every frame, so the refined tip almost always lands at the shaft / field point boundary rather than at the actual front. With the nock trustworthy, the tip is slid outward along the fitted axis until the chord matches the calibrated arrow length: a small (~10 px, the visible field-point length) but systematic correction.

Three sanity checks reject the obviously-bad detections before the frame is marked valid:

Edge buffer.If either endpoint lands within 25 px of any frame edge, the real endpoint is probably off-screen and the chord we measured is not the real arrow length.
Chord length deviation. If the chord length is more than 15% off the calibrated length, the line fit has locked onto unrelated structure.
Chord angle deviation.At 5000 fps the arrow's pose changes by at most a fraction of a degree between frames. If the detected chord angle is more than 5° off the per-shot calibrated chord angle, the fit is mechanically impossible and is rejected.

Frames that fail any of these tests are written out with valid = False and a reject reason; the downstream aggregation skips them.

Sample the Shaft Shape

For frames that pass, the shape of the shaft itself is recovered. The inliers are rotated into a coordinate system where one axis runs along the chord (nock-to-tip) and the other is perpendicular. A gentle curve (a polynomial up to cubic, just enough to capture any bow in the shaft) is fit to the perpendicular offsets as a function of along-chord position. Sixty equally-spaced points are then read off that curve along the middle two-thirds of the chord, staying away from the fletching mass at the nock end and the field-point extrapolation at the tip end.

When a per-build flex envelope is available (from the capture page), a second smoother centerline is computed by fitting a single amplitude to that build's analytic flex shape rather than to a free polynomial, and clipped to the maximum flex the envelope predicts. That envelope-bounded centerline is what gets reported as the per-frame flex value and what shows up in the validation viewers.

Validate Against the Manual Marks

The detector's tip and nock at each of the three hand-annotated frames are compared against the human-marked positions for the same frame. A shot is flagged okif the worst-frame residual is at most 4 px, and annotation_validation_failed otherwise. This catches the cases where the detector silently goes off the rails (a strong shadow, a stationary feature that the median did not subtract, and so on) without anyone having to scroll through the whole shot.

Computing Yaw and Flex per Frame

The per-frame detections from the previous section get turned into the two physical quantities of interest.

Yaw

Yaw is how far off-line the nock has swung from the direction of travel. The direction of travel is taken from the tape reference line built during calibration (the long tape measure laid along the table). For each frame we draw an imaginary line through the tip parallel to the tape and measure the perpendicular distance from the nock to that line. The sign is preserved (positive on one side, negative on the other), so a build that consistently yaws one direction can be told apart from one whose yaw direction wanders:

yaw_px = signed perpendicular distance from nock to (line through tip, parallel to tape)

The value is converted to inches using the per-shot pixels-per-inch ruler from the calibration step.

Flex

Flex is how much the shaft is bowing in the middle. We take the centerline points from the previous section, draw a straight line from the tip to the nock (the chord), and measure the largest perpendicular distance from that chord to any centerline point. The measurement is restricted to the middle of the shaft (t ∈ [0.2, 0.8]) so the fletching mass on the nock end and the tip extrapolation on the tip end do not bias it:

flex_px = largest perpendicular distance from chord to centerline, middle 60% of the shaft

The signed value at the same point is also kept, so direction-of-bow is recoverable. Like yaw, flex is reported in pixels and inches.

Yaw is computed on every detector-valid frame; flex also requires at least half of the 60 centerline points to be accepted. Each shot therefore produces a list of per-frame yaw values and a list of per-frame flex values, indexed by the original cine frame number.

Per-Shot Aggregation

Within a single shot, the per-frame yaw and flex values are filtered to remove residual bad frames, tagged with the physical bow-relative position they correspond to, and summarized into one number per shot.

Frame-Level Filters

Four filters run in series. The first three are automatic; the fourth honors a manually-curated exclusion list described below.

Manual exclusions. Any frame whose (cine-file, frame) pair appears in the exclusions TSV is dropped first. Dropping manual rejections up front matters because the velocity-sanity filter compares adjacent kept frames; if a bad frame were left in, removing it later could implicate its neighbors as outliers.
In-frame check.Frames where the tip or nock falls outside the 1280×832 image are dropped. The detector occasionally marks a frame valid even after one endpoint has actually exited the frame, and those extrapolations produce spurious yaw spikes.
Minimum length. Frames whose detected tip-to-nock distance is less than half the calibrated arrow length are dropped. These are typically frames where a fragmentary fit slipped past the in-detector length check.
Velocity sanity.Adjacent kept frames must look like a real left-to-right flight: no backward motion of more than 5 px per frame and no forward jump of more than 30 px per frame. The actual frame-to-frame displacement at 5000 fps is on the order of 10 to 15 px.

Manual Frame Review

The automatic filters catch obvious detector failures but leave a long tail of frames that pass every threshold and still measure something the detector should not have trusted: a flicker that survived background subtraction and pulled the chord angle off by a degree, a fletching reflection that biased the nock, a frame where one endpoint touched a stationary feature the median did not subtract. Each (build, torque, distance) cell was reviewed by hand against the underlying TIFFs in an internal viewer, and any such frames were added to the manual exclusions list that the filter above consumes on the next pipeline run.

Bow-Relative Position and Per-Shot Summary

The detector reports the tip's x in image pixels. The physical position of the tip relative to the bow at any frame is the camera's bow-to-left-frame-edge distance (from the shot log, in feet) plus the additional distance from the left frame edge to the tip, measured at the arrow's height:

position_ft = distance + tip_x · (arrow_length_in / 12) / arrow_length_px

The second term uses the per-shot pixel arrow length from the calibration step rather than a fixed pixels-per-foot scale. The arrow flies above the table and is therefore closer to the camera than the table is, so its on-screen scale is larger than the table's; using the table scale would underestimate the tip's actual bow-relative position by a few percent.

Visible tip travel within a single shot is usually about 4 ft, less than the 8 ft table width, because the camera was set up so the arrow enters and exits in roughly the time the buffer wants to record. Consecutive camera stations therefore leave small gaps in bow-distance coverage; those show up as missing points rather than interpolated values in the final plots.

The kept frames of each shot produce a per-shot yaw summary (mean, std, n, position_mean, position_min, position_max) and an analogous flex summary. The position range is recorded explicitly because it is the natural horizontal error bar when plotting; each shot covers a stretch of bow-relative distance, not a single point.

Cross-Shot Aggregation

Three shots were fired at every (build, torque, camera station) combination, so the cross-shot step is a mean of those per-shot means within each cell, with the sample standard deviation across shots reported alongside. Two views of the aggregated data come out:

Profile view.Per-shot yaw or flex plotted against bow-relative position, colored by build and torque, with the per-shot position range as a horizontal error bar and the within-shot standard deviation as a vertical error bar. This shows how the metric evolves along the flight: yaw rises sharply over the first few feet as the arrow recovers from archer's paradox and then settles; flex oscillates with the shaft's natural bending frequency, visible as a damped wave in the profile.
Summary table view. Position collapsed into discrete distance bins (the camera-station centers, in 8 ft increments because the table is 8 ft wide), producing a (build, torque) × distance table where each cell is the mean of the per-shot means contributing to that bin.

Both views feed an internal exploration app where build, torque, and distance subsets can be selected without re-running the aggregation, plus a peak- analysis tab that scatters per-shot peak yaw or peak flex against build properties (spine, FoC, point weight, insert weight, arrow weight, launch velocity) to look for cross-build trends.

Limitations and Future Capture Changes

The motion-based pipeline is the best version of this measurement we have to date, but it is not bias-free. Below is what currently leaks into the numbers, what survives the pipeline at the end, and what we would change about the capture setup next round.

Data-Level Issues in the Current Pipeline

Shadows that move with the arrow. The arrow casts shadows on the table that share its shape, orientation, and velocity. Anything that distinguishes the arrow by motion or by shape (including this pipeline) sees the shadow as a second arrow.
Fletchings. The fletchings move with the arrow and sit just inside the nock. They satisfy the same motion and shape criteria as the shaft, so they pull endpoint estimates inward and add mass on one side of the chord, which biases centerline and flex fits.
Brightness flicker between frames. Small lighting changes between high-speed frames cause whole-frame intensity shifts. Frame differencing and background subtraction read those as motion spread across the entire frame. The 10%-of-frame guard catches the worst cases; smaller flickers can still bias a few frames.
Stuck or hot pixels. A handful of sensor pixels sit at a fixed bright or dark value across every frame. Background subtraction keeps them in, where they appear as persistent false detections.
Low-contrast field points. The silver field point reflects differently every frame, so the detected "tip" lands somewhere ambiguous along the field point rather than at its actual front. The slide-to-calibrated- length correction patches this systematically, but the underlying ambiguity is still there.
Stabilizer and other stationary features. The bow stabilizer hangs into the frame and looks much like the flying arrow (same shape, similar size), so the detector can mistake it for a second arrow. Other fixed clutter (the table grid lines in particular) lies at roughly the same vertical position as the arrow's path and can pull a trajectory fit off when the arrow signal is weaker than the clutter.
Inconsistent zoom between shots (lens distortion). The lens was at different zoom levels on different shots. The pixel-to-inch scale itself is fine (it is recovered per shot from the known arrow length), but lens distortion is not. A ChArUco distortion calibration was done at one zoom level and is not valid at the others, so distortion correction cannot be applied consistently across the dataset.
Background grid not in the arrow plane. The grid sits behind the flight plane, so the arrow's apparent position against the grid shifts with viewing angle. Absolute-position measurements that reference the grid are off by the parallax between the two depths.
Inconsistent lighting between shots. Overall lighting and contrast vary noticeably between shots, so thresholds have to be tuned per shot.

Future Capture Changes

The next capture pass would change several things about the setup, most aimed at making the detector's job easier rather than at the detector itself:

Continuous background grid from start to end of the flight, with large numbers marking absolute position relative to the bow.
Orient the background grid so its lines are never aligned with the arrow trajectory.
Ensure the arrow itself is in focus.
Place the background grid as close to the arrow's flight plane as possible.
Fix the lens and zoom so they are identical across all shots.
Account for the camera-to-table angle when setting up the next station so adjacent captures overlap.
Mount the camera much higher (if possible) to capture the entire flight in a single shot, eliminating stitching and reducing the arrow-to-grid parallax.
Capture launch velocity (Garmin) for each shot.
Use a solid nock instead of a translucent one.
Use a solid black field point.
Diffuse or multi-angle lighting to soften the directional shadow that currently mimics the arrow.
Matte, low-reflectivity background in a neutral tone that contrasts cleanly with both bright fletching and dark shaft.
Remove distractions from the frame (bow stabilizer, reflective surfaces, anything else that moves or has strong gradients near the arrow's path), since they create persistent false detections.