Note: this article refers to “git pull -r” and “git pull –rebase” interchangeably. They are the same command, except the merge-preserving variation can only be specified via the long form:
git pull --rebase=preserve
I’ve long known that “git pull –rebase” reconciles the local branch correctly against upstream amends, rebases, and reorderings. The official “git rebase” documentation attests to this:
‘Note that any commits in HEAD which introduce the same textual changes as a commit in HEAD..<upstream> are omitted (i.e., a patch already accepted upstream with a different commit message or timestamp will be skipped).’
Thanks to the git patch-id command it’s easy to imagine how this mechanism might work. Take two commits, look at their patch-ids, and if they’re the same, drop the local one.
But what about squashes and other force-pushes where
git patch-id won’t help? What does “git pull -r” do in those cases? I created a series of synthetic force-pushes to find out. I tried squashes, merge-squashes, dropped commits, merge-base adjustments, and all sorts of other force-push craziness.
I was unable to confuse “git pull –rebase,” no matter how hard I tried. It’s bulletproof, as far as I can tell.
Investigating ‘git pull –rebase’
The context here is not a master branch that’s advancing. The context is a feature branch that two people are working in parallel, where either person might force-push at any time. Something like this:
git init echo 'a' > a; git add .; git commit -m 'a' echo 'b' > b; git add .; git commit -m 'b' echo 'c' > c; git add .; git commit -m 'c' git checkout -b feature HEAD~1 echo 'd' > d; git add .; git commit -m 'd' echo 'e' > e; git add .; git commit -m 'e' echo 'f' > f; git add .; git commit -m 'f' git checkout -b gabriel/feature echo 'gf' > gf; git add .; git commit -m 'gf' --author='Gabriel Lajeunesse <email@example.com>' git checkout -b evangeline/feature HEAD~1 echo 'ef' > ef; git add .; git commit -m 'ef' --author='Evangeline Bellefontaine <firstname.lastname@example.org>' git push --mirror [url-to-an-empty-git-repo]
In each scenario Evangeline rewrites the history of origin/feature with a force-push of some kind, usually incorporating her own ‘ef‘ commit into her push. Meanwhile, Gabriel has already made his own ‘gf‘ commit to his local feature branch. For each scenario we want to see if Gabriel can use “git pull –rebase” to correctly reconcile his own work (his ‘gf‘ commit) against Evangeline’s most recent push.
- We assume Gabriel has correctly setup remote tracking for his local feature branch. This is a reasonable assumption, since git sets this up by default when a user first types “git checkout feature”.
- We only tested Git v2.14.1 and Git v1.7.2 for this experiment. Perhaps “git pull –rebase” behaves differently in other versions.
- Important: we only use “git pull –rebase” (or -r). Some people claim “git fetch; git rebase origin/master” is equivalent to “git pull -r”, but it isn’t.
For each scenario we are on Gabriel’s local branch feature. The graph on the left shows both the state of origin/feature (thanks to Evangeline’s force-push), as well as the state of Gabriel’s local feature and how it relates to Evangeline’s force-push. The graph on the right shows the result of Gabriel typing “git pull -r”.
A scenario is deemed successful if “git pull -r” results in Gabriel’s ‘gf‘ commit sitting on top of origin/feature. Since Gabriel does not push back in these scenarios, his ‘gf‘ commit remains confined to his local feature branch.
- origin/feature rebased (against origin/master)
This is the canonical example of why we prefer “git pull -r”. The rebase notices that older commits ‘d‘, ‘e‘, and ‘f‘ on Gabriel’s feature branch are patch-identical to the rebased ones on origin/feature, and thus it only replays the final ‘gf’ commit.
git pull -r
- origin/feature squash-merged (with origin/master)
This is the rebase + squash combo meal. Evangeline takes all work on feature, squashes it down to a single commit, and rebases it on top of origin/master. She probably did this via “git merge –squash.” I did not expect “git pull -r” to be able to handle this, but I was wrong.
git pull -r
- origin/feature squashed in-place
This is the classic squash. Evangeline types “git rebase –interactive origin/master”. In the interactive screen she marks the first commit as “pick” and every other commit as “squash” or “fixup”. This squashes feature down to a single commit, but leaves the merge-base alone (commit ‘b‘ in this case). I also did not expect “git pull -r” to handle this one, but I was wrong here, too.
git pull -r
- origin/feature dropped a commit
For some reason Evangeline decided she wanted to drop commit ‘e‘ from origin/feature. She ran “git rebase –interactive origin/master” and marked every commit as “pick,” except commit ‘e‘, which she marked with “drop”. I expected “git pull -r” to erroneously bring commit ‘e‘ back. I was wrong. Running “git rebase” instead of “git pull -r” did bring commit ‘e‘ back, and so there is obviously some deeper intelligence inside “git pull -r” enabling the correct behaviour here.
git pull -r
- origin/feature lost their mind
I have no idea what Evangeline was trying to do here. If you look closely, you’ll see she reversed her branch (‘ef’ is now the oldest commit), she squashed the middle two commits, and she adjusted the merge-base so that origin/feature emerges from commit ‘a‘ on the mainline instead of commit ‘b‘. This is one serious force-push! I had no idea what to expect here. I certainly did not expect “git pull -r” to nail it, but it did.
git pull -r
- origin/feature went back to how things were (undoes the rewrite)
Evangeline, either through her reflog or her photographic memory, happened to remember that origin/feature previously pointed to commit ‘325a76a‘. Here she force-pushes origin/feature back to ‘325a76a‘ to undo her push from scenario 5. The command to do that is useful to know: “git push –force origin 325a76a:refs/heads/feature”. Staring in awe at how “git push -r” did the right thing for scenario 5, all I could do was continue to stare when it did the same here. (Note: Gabriel’s start-state here is scenario 5, not the original start-state).
git pull -r
Dropped Merges: Since “git pull -r” is a rebase, it drops all local merges during reconciliation. This is usually what you want: why keep a bunch of pointless sync-merges around? They just add noise and no value to the commit graph. But sometimes you do want to keep a merge. When you do, you can try git pull’s merge-preserving variation:
git pull --rebase=preserve
Stash and Stash-Pop: The “git rebase” command refuses to run if your worktree is dirty, whereas default “git pull” will proceed as long as incoming changes have no conflicts with unstaged edits. This means “git pull” will run in many situations where “git pull -r” will refuse because of the dirty worktree. The solution it to stash and then stash-pop, either manually, or via the “–autostash” flag:
git pull -r --autostash
Conflicts, a.k.a. Rebase Hell: Rebase hell happens when several commits on your branch edit the same area, and upstream also touched the same area. The problem occurs because each conflict resolution will itself conflict with the subsequent commit in the series. And since the conflict markers tend to touch the same areas again and again, it feels like an infinite hall of mirrors, and makes you lose your mind.
If you use “git pull -r”, you will eventually experience rebase hell. Here’s some tips for surviving it:
- Single-commit branches are immune from rebase hell. Rebasing a branch with only a single commit can trigger at most a single conflict resolution.
- If your branch has many commits, and you find yourself in rebase hell, try aborting, squashing, and then rebase. Squashing is always a viable way out of rebase hell.
- Personally, I’ve never investigated the “git rerere” command, but it’s also another tool available to help with rebase hell.
Conclusion: Time To Revise The Golden Rule
Supposedly, the golden rule of git is to never force-push public branches.
Of course I would never force-push against ‘master’ and ‘release/*’. As a git admin, that’s always the first config I set for a new repo: disallow all rewrites for ‘master’ and ‘release/*’.
But all public branches? I find force-pushing feature branches incredibly useful.
Industry has arrived at a compromise: defer the rewrite to the final merge. Bitbucket, Gitlab, and Github now offer “rebase” and “squash” flavours of PR merge. But it’s a silly compromise, because the golden rule itself is silly. Instead of building complex merge machinery to dance around the golden rule, I think we’d be better served by reworking the rule itself. Three reasons:
- Force-pushes are useful! Public amends, squashes, and rebases help us make better PR’s for code review.
- What is the actual point of the golden rule? Are we trying to prevent lost work on the mainlines (e.g., ‘master’ and ‘release/*’)? If that’s the point, then we’re much better off setting appropriate branch permissions on our central git server for those branches.
- Is the point to prevent the spaghetti graphs caused by default “git pull” behaviour? In that case a better golden rule would be never use default “git pull” and always use “git pull –rebase”, since it avoids spaghetti graphs, while allowing history rewrites.
I propose a new golden git rule (in haiku form):
We never force-push master
or release. But always,
for all branches: git pull -r
Alternatively, you can make “git pull -r” the default behaviour:
git config --global pull.rebase true