Summary of my problem: One of my private repositories on Bitbucket suddenly more than doubled in size after I pushed an addition of a few hundred bytes to two existing files. The repo is now over 2GB, which has caused Bitbucket to put it into read-only mode. Because it is in read-only mode, I cannot push changes that would reduce the repo size. (Catch 22.)
Details: My company recently began hosting git repositories on Bitbucket. One of the repositories I am in charge of had a size of about 973MB, which was uncomfortably close to the 1GB soft limit. To reduce the repo size, I followed the instructions in the Bitbucket documentation article Split a repository in two and moved about 450MB worth of documentation and online help files into their own private repo. I then followed the instructions in the Bitbucket documentation articles Reduce repository size and Maintaining a git repository , specifically:
git count-objects -vH
showed me a size-pack of about 973MB.
I ran git filter-branch --index-filter 'git rm --cached --ignore-unmatch doc' HEAD
to remove the doc directory (which is the content I'd moved to the new repo).
I ran the following commands to expire reference and prune:
git for-each-ref --format="%(refname)" refs/original/ | xargs -n 1 git update-ref -d
git reflog expire --expire=now --all
git gc --prune=now
git count-objects -vH
then showed me a size-pack of 881.1 MiB and du -sh .git/objects
returned 882M. I was disappointed that moving over 450MB reduce the repo size by less than 90MB, but pushed the changes to Bitbucket nevertheless:
git push --all --force
git push --tags --force
The settings page for the Bitbucket copy of the repo continued to show a size of 973MB. I logged out, refreshed the browser, logged back in, but that didn't help -- the repo size remained at 973MB.
This morning (three days after the changes described above) I made a couple of minor additions to two existing files which increased the files' sizes by a total of less than 1KB, added and commited them to my local repo, then pushed the change to Bitbucket. A few minutes later I took a look at the Bitbucket page for the repo and saw a red warning banner informing me "This repo is over the 2 GB limit and is in read-only mode." The settings page now says the repo has a size of 2.3 GB.
The push of a few hundred bytes added to two files was definitely the only activity to occur on the remote repo in the last three days, according to Bitbucket. That push may not have been the cause of the repo more than doubling in size, but the two events were closely correlated in time.
git reflog show
returns nothing.
Cloning a new copy into an alternate directory, then running git count-objects give me a size-pack of 881.29 MiB.
The local repository is on a CentOS 6.5 system. git version is 1.8.5.3.
Questions
I've found that the easiest way to reduce the Bitbucket repo size if you are over the 2GB limit is to
This should trigger Bitbucket to run git gc
on the repo.
After conferring with Bitbucket technical support, I can now answer some of my own questions:
First of all check the repository size in your local using the following command :-
git count-objects -Hv
We can use following commands
git reflog expire --expire="1 hour" --all
git reflog expire --expire-unreachable="1 hour" --all
git prune --expire="1 hour" -v
git gc --aggressive --prune="1 hour"
Now , again use the command git count-objects -Hv
to notice the change in the size and garbage of repository
This is a known bug on bitbucket cloud side, see BCLOUD-19794 .
Garbage file is intermittently counted in the repository size.
When pushing to the remote repository a GC is triggered afterwards which generates a garbage file. This garbage file is cleared on the next subsequent GC. Between those two GC's the size of the repository is displayed incorrectly within Bitbucket UI as the garbage file size is intermittently counted towards the repository total size.
As noted in the workaround section, you need to contact bitbucket to manually run the GC.
Bitbucket might take action sooner rather than later if enough people go vote for it.
As I am sure those familar with got already know, but git stores your version history for files, so making changes and pushing files will not reduce your repo size.
There are still several ways to reduce repo sizes on bitbucket, GitHub, gitlab, etc. The best way is to delete branches, as that permanently deletes any files being recorded by that branch, as long as it is not being tracked by another. But you may want the latest files in that branch, so do the following:
--cached
to delete remote branch.Depending on host, you may have to run special commands, but this should work in most cases.
The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.