Sophie

Sophie

distrib > Mageia > 7 > aarch64 > by-pkgid > 7e647d9940d31b34c253e6f71c416c4b > files > 2596

bzr-2.7.0-6.mga7.aarch64.rpm

************
Nested Trees
************

:status: 2012-03-17: Draft spec

.. sectnum::

.. contents::

Principles
**********

- Never store a location in versioned data.

- Implementation of nested trees shall not make operations observably slower
  for those not using nested trees.

- A repository that holds a revision R should be able to reconstruct the
  whole contents of that revision, including any nested trees.  Corolary:
  if I fetch that revision, even into a branch that has no working tree, it
  should bring across any referenced revisions, or (implicitly) add fallback
  repositories.

- The introduction or possible support for nested trees should not
  have an impact on performance.

Core Concepts
*************

**Subtree** A tree which is inside another tree, which bzr has been asked to
treat as part of the outer tree.

**Subbranch** The branch associated with a subtree.

**Containing tree** A tree which has another tree inside it

**Tree reference** A directory in a containing tree which contains a subtree.


Basic design approach
*********************
(see "Design decisions" for extended rationale)
By default, APIs and commands for containing trees should behave as though the
subtrees were plain directories.  By default, commands in subtrees should not
affect the containing trees.

Downwards recursion
~~~~~~~~~~~~~~~~~~~
One of the objectives of nested trees is to provide ways of reproducing
historical combinations of different codebases.  The dependency chain points
downwards, such that trees are affected by the revision of their subtrees, but
subtrees are oblivious to their containing trees.  Just as bazaar doesn't
entice people to commit inconsistent trees, it should not entice people to
commit inconsistent combinations of containing tree and subtree.  Therefore,
commit should recurse downwards by default.

Status and diff should reflect what will happen when commit is used, so they
should also recurse downward by default.  Add almost does this already.  With
status, diff, commit and add recursing downwards, it would be confusing to
users if other operations did not.  Therefore, all operations should recurse
downwards by default.

No upwards recursion by default
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
One of the reasons for using nested trees is to gain performance by only
committing in a subtree.  Therefore, operations should not recurse upwards by
default.  However, some users do want to have upwards recursion, so it should
be provided as an option.

Modelling nested trees as a composite tree
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
The idea that a set of nested trees behaves like a single, larger tree seems
relatively easy to grasp.  Both for users and for developers, it provides a
clear expectation for the behaviour of nested trees.  There are no obvious
drawbacks in terms of code clarity or performance.  Therefore, it seems like a
good model to start with.

Using root file-ids for tree-references
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
The idea that tree-reference file-ids are the same as the file-ids of the
corresponding root directories has a nice symmetry.  It is one way of ensuring
that "bzr split" is deterministic, and "bzr join" is deterministic.  When
performing operations involving a tree and a split version of that tree, using
the same file-id makes it easy to ensure that operations such as moves and
renames are applied appropriately to the tree-reference.  Providing mechanisms
whereby a tree-reference can be treated as it would if it had its old file-id
encroaches on the territory of path tokens or file-id aliases.  Having "split"
cause file-id changes means that in comparing these revisions, it would be seen
as a deleting a directory and creating a new tree-reference with the same name.
Handling this correctly in operations such as merge and revert would be more
complicated than if it were treated as a kind change, especially when
unversioned files are present in the subtree.

Sub-branches
~~~~~~~~~~~~
The branches associated with subtrees shall be called "subbranches".

The branch for the top tree will be in a special format, whose last_revision
file lists all the last_revision info for all of the branches associated with
the nested tree.  The .bzr directories of subtrees will have a "branch" that
simply indicates that the top tree's branch should be used.

In the top tree's last_revision file, the revision id and revno will be
provided, indexed by the tree-reference file-id.

The repository used by the top-tree's branch must be a shared repository, and
will be used by the sub-branches.

Only the top branch will have a branch.conf.  When an operation on a subbranch
would normally use values from branch.conf it will look them up in the top
branch's branch.conf and adjust for the sub-location if appropriate.  e.g. "bzr
push" in a subtree will push just that subbranch to the corresponding subbranch
in the configured push location of the top branch.

Rationale
.........
If the branches were not local, the local subtrees might not be committable,
and commits to the remote branch would make the local subtree out of date.
They should not be in a separate location from the containing branch, because
they might share history with the tree-reference's branch.  However, those
local branches should not be at the same location as their tree, because the
tree might be deleted or moved.  Indeed, they should not be anywhere within a
working tree.

subtree branches should not be above or beside their containing branch, because
it could cause terrible confusion if subtrees from two different trees were
updating the same branches with every push, pull, commit and uncommit.

Subtree branches could be plain branches stored somewhere in the top tree's
branch, but then a lookup mechanism would be needed to translate from file_id
to location, and performance with large numbers of subbranches would be poor.

Pull and non-initial push
~~~~~~~~~~~~~~~~~~~~~~~~~
When a pull involves updates to tree references, pull will always pull into the
reference branch.  For all new revisions in the upper branch, it will determine
the revision values of tree references, and fetch them into the repository.

When new tree references are encountered, pull should create a corresponding
subbranch in the top branch.

Pulls will update the subtrees whose tree-references change, including creating
trees for new sub-branches.

Implementation strategies
*************************

Data storage
************

Trees
~~~~~
The root-ids of trees must be unique, so that the same file-id can be used in
both the containing tree and the subtree, to simplify access within trees.
Tree references are an inventory type that is distinct from a directory, and
has a revision-id associated with it.
All modern working trees support tree references.  Indices may be provided to
ensure fast access to the list of subtrees.

The various methods on ``Tree`` need to be updated to handle nested trees.

Tree file ids are tuples containing inventory file ids, describing a path
to the file. This means that if a file is in a nested tree with the root 
fileid ``file_id_a`` and the file itself has the inventory file id ``file_id_b``
then the tree file id is (file_id_a, file_id_b). This makes it easy to look up file ids
without having to load and scan all nested trees for ``file_id_b``.

Branches
~~~~~~~~
A new branch format, "subbranches", is introduced which provides multiple
sub-branches, with their data referenced by file-id.  A new branch refrerence
format, "subbranch-reference", is introduced which refers to sub-branches in a
"subbranches" branch.

Repositories
~~~~~~~~~~~~
Some repository formats have 'subtree' variants, e.g. pack-0.92-subtree,
development-subtree.  These are hidden, experimental formats that support
storing tree-references in their inventory formats.

Repository indexing might be extended to provide fast access to
tree-references.

Commands
********

The following new options are introduced:

``join --reference`` Cause an inner tree to be treated as a subtree.  The outer
tree's branch must be in the new "subbranches" format.  The inner tree's branch
will be cloned into the "subbranches" branch, and the local branch will be
replaced with a "subbranch-reference".  Finally, a tree-reference will be
added to the containing tree.

(this is already implemented)

API Changes
***********

Tree file ids as tuples
~~~~~~~~~~~~~~~~~~~~~~~

Implementation Changes
**********************

Branch changes
~~~~~~~~~~~~~~

 pull recurses into reference branches, and pulls *from* the source's reference
 branches.

Repository changes
~~~~~~~~~~~~~~~~~~

 fetch provides a list of tree-reference revision ids/file-id pairs for the
 revisions that were fetched.  Fetch automatically fetches all revisions
 associted with tree-references that were fetched.

Use Cases
*********

Case 1
~~~~~~
Barry works on a project with three libraries.  He wants to keep up to date
with the tip of those libraries, but he doesn't want them to be part of his
source tree.

Example commands::

 Set up the tree:
 $ bzr branch --nested http://library1 project
 $ bzr branch --nested http://library2 project
 $ bzr branch --nested http://library3 project
 $ bzr commit project -m "Added three libraries"

 Update a library to tip:
 $ bzr pull -d project/library1 http://library1

Case 2
~~~~~~
Now, Barry wants to add a fourth library.

Example commands::

 $ bzr branch --nested http://library4 project

Case 3
~~~~~~
Barry wants to publish his project.

Example commands::

 $ bzr push -d project bzr+ssh://project/trunk

Case 4
~~~~~~
Barry decides to make part of his project into another library

Example commands::

 $ bzr split --nested project/newlibrary

Case 5
~~~~~~
Curtis wants to hack on Barry's project

Example commands::

 $ bzr branch http://project/trunk

Case 6
~~~~~~
Barry wants to drop one of the libraries he was using

Example commands::

 $ rm project/library1
 $ bzr commit project -m "Removed library1"

Case 7
~~~~~~
Curtis has made changes to one of the libraries.  Barry wants to merge Curtis'
changes into his copy.

Example commands::

 $ bzr merge -d project http://curtis.org/trunk/library2

 Or alternatively:
 $ bzr merge -d project/library2 http://curtis.org/trunk/library2

Case 8
~~~~~~
Curtis has made changes to Barry's main project.  Barry wants to merge Curtis'
changes into his copy.

Example commands::

 $ bzr merge -d project http://curtis.org/trunk


Case 9
~~~~~~
Barry makes changes in his project and in a library, and he runs status

Example commands::

 $ echo bar > project/foo
 $ echo qux > project/library2/baz
 $ bzr status project
  M foo
  M library2/baz

Case 10
~~~~~~~
Barry wants to upgrade the bazaar format of his project

Example commands::

 $ bzr upgrade project

Case 11
~~~~~~~
Curtis wants to apply Barry's latest changes.

Example commands::

 $ bzr merge -d project http://project/trunk

Case 12
~~~~~~~
Danilo wants to start a project with two libraries using nested trees from
scratch.

Example commands::

 $ bzr init project
 $ bzr branch --nested http://library4 project
 $ bzr branch --nested http://library5 project
 $ bzr commit project -m "Created new project."

Case 13
~~~~~~~
Edwin has a project that doesn't use nested trees and he wants to start using
nested trees.

Example commands::
 $ bzr split --nested project/subdir

Case 14
~~~~~~~
Françis has a project with nested trees where the containing tree uses one
Bazaar format and the subtree uses a different Bazaar format.

Not supported.

Case 15
~~~~~~~
Barry commits some changes to a library and to the main project, and then
discovers the changes are not appropriate.  He has not yet pushed his changes
anywhere.

Example commands::

 $ bzr merge -d project http://library2
 $ bzr commit project -m "Updated library2"
 $ bzr uncommit project --force

Case 16
~~~~~~~
Barry commits some changes to a library and to the main project, publishes his
branch, and then discovers the changes are not appropriate.

Example commands::

 $ bzr merge -d project http://library2
 $ bzr commit project -m "Updated library2"
 $ bzr push -d project
 $ bzr revert -r-2 project
 $ bzr commit project -m "Reverted inappropriate changes."
 $ bzr push -d project

Case 17
~~~~~~~
Gary is writing a project.  Henninge wants to split a library out of it.

Example commands::

 $ bzr branch project
 $ bzr split project/library6
 $ mv project/library6 .
 $ rm project
 $ bzr commit -m "split library6 into its own library."

Case 18
~~~~~~~
Henning wants to update to receive Gary's latest changes.

Example commands::

 $ bzr merge -d library6

Case 19
~~~~~~~
Gary wants to update to receive Henninge's changes, including splitting a
library out.

Example commands::

 $ bzr split --nested project/library6
 $ bzr commit project -m "Turned library6 into a library"
 $ bzr merge -d project/library6 http://library6
 $ bzr commit project -m "Merge Henninge's changes."


Case 20
~~~~~~~
Gary wants to update to receive Henninge's changes, without splitting a library
out.

 $ bzr split --nested project/library6
 $ bzr commit project -m "Turned library6 into a library"
 # i.e. a cherrypick that skips the revision where library6 became a library.
 $ bzr merge -d project/library6 http://library6 -r 5..-1
 $ bzr commit project -m "Merge Henninge's changes."

Case 21
~~~~~~~

John works on a project called FooBar, but has decided that it would be better
structured as two projects, where Bar is a library that may be of general use
outside of Foo.  As it happens, bar already has its own subdirectory.

He runs:
::

    # Convert into two trees: foobar and foobar/bar.
    # In each tree, files will be removed and deleted.  In foobar/bar, "bar"
    # will have been moved to become the tree root.
    # These changes will be committed later.
    $ bzr upgrade foobar --format=subbranches
    $ bzr split foobar/bar

    # Add a tree-reference from foobar to foobar/bar, change bar's branch
    # to a reference to subbranch in foobar's branch.
    $ bzr join --nested foobar/bar

    # This recurses into foobar/bar and commits the deletion of the containing
    # tree.  In foobar, it commits a kind change for 'bar' from directory to
    # tree-reference, and the deletion of the contents of bar.
    $ bzr commit foobar

This commits new revisions to foobar and bar, and foobar's tree-reference bar
refers to the revision-id of bar.

Next, he adds two new files: foobar/baz and foobar/bar/qux::

    $ vi foobar/baz
    $ vi foobar/bar/qux
    # This adds qux to foobar/bar and adds baz to foobar.
    $ bzr add foobar

Since foobar/bar/qux is in a commitable state and foobar/baz is not, he invokes
::

    $ bzr commit foobar/bar

This commits foobar/bar/baz/qux to the subtree and commits foobar/bar to the
containing tree.

(Had he wanted to commit to just the subtree, or just the containing tree, he
could have specified an option.)


Case 21
~~~~~~~

Robert wants to hack on a project, Baz, that is structured as a nested tree,
which uses the library "quxlib", from quxlib.org.

He runs:
::

    $ bzr branch http://baz.org/dev baz

This creates a "subbranches" branch and working tree for baz, as normal.  Since
tree-references were encountered, it adds subbranches for them to the baz
branch.  All data is retrieved from baz.org, not quxlib.org.

It creates a working tree for quxlib with a subbranch-reference.  It uses the
revision-id from the tree-reference in the containing tree, not the head
revision at baz.org.  This allows Robert to get a known-good nested tree.

Later, Robert decides to update the version of quxlib being used to the latest
from quxlib.org.  He runs::

    $ bzr pull -d http://quxlib.org

This updates the version of quxlib in the working tree, which mean that baz is
now out-of-date with its last-committed tree.  Unfortunately, the new rev on
quxlib is not completely compatible with the old one, and Robert must tweak a
few files before Baz runs properly.  Once he has done so, he runs::

    $ bzr commit baz

Now he has committed a known-good nested tree, and the baz working tree is once
again up-to-date.


User documentation
******************

For many large projects, it is often useful to incorporate libraries
maintained elsewhere or to construct them from multiple subprojects.
While it is easy for a single user to set up a particular layout of
multiple branches by hand, the different branches really need to be
linked together if others are to reproduce the desired layout, and
if the relationships are going to be managed over time.

Bazaar has good support for building and managing external libraries
and subprojects via a feature known as *nested trees*. In particular,
nearly all of Bazaar's commonly used commands understand nested trees
and Do The Right Thing as explained below. The relationship is hierarchical:
the containing tree knows about its nested trees, but nested trees are unaware
of the tree (or trees) containing them.

At the moment, *nested trees* are the only type of nested item
supported though *nested files* may be supported in the future.
Nested trees may contain other nested trees as required.

Note: This feature requires a recent branch format such as ``2.0``
or later.


Nesting an external project
~~~~~~~~~~~~~~~~~~~~~~~~~~~

To link an external project into a branch, use the ``branch`` command
with the ``--nested`` option like this::

  bzr branch --nested SOURCE-URL TARGET-DIR

For example, assuming you already have a ``src/lib`` directory where
libraries are kept::

  bzr branch --nested http://example.com/xmlsaxlib src/lib/sax

This will create a nested branch in the ``src/lib/sax`` directory,
join it into the containing branch and save the source location.

If you now run ``bzr status``, it will show the nested branch as
uncommitted changes like this::

  +  src/lib/sax
  +  src/lib/sax/README
  +  src/lib/sax/parser.py
  ...

To record this change, use the ``commit`` command as you normally would::

  bzr commit -m "added SAX parsing library"

Note that Bazaar stores the tip revision of each nested branch. This
is an important feature in that it's then easy to reproduce the exact
combination of libraries used for historical revisions. It also means
that other developers pulling or merging your changes will get nested
branches created for them at the right revisions of each.


Refreshing a nested branch
~~~~~~~~~~~~~~~~~~~~~~~~~~

As bugs are fixed and enhancements are made to nested projects, you
will want to update the version being used. To do this, ``pull`` the
latest version of the nested branch. For example::

  bzr pull -d src/lib/sax

If the latest revision is too unstable, you can always use the ``-r``
option on the ``pull`` command to nominate a particular revision or tag.

Now that you have the required version of the code, you can make
any required adjustments (e.g. API changes), run your automated tests
and commit something like this::

  view src/lib/sax/README
  (hack, hack, hack)
  make test
  bzr commit -m "upgraded SAX library to version 2.1.3"


Changing a nested tree
~~~~~~~~~~~~~~~~~~~~~~

As well as keeping track of which revisions of external libraries
are used over time, one of the reasons for nesting projects is to
make minor changes. You may want to do this in order to fix and
track particular bugs you need addressed. In other cases, you may want
to make various local enhancements that aren't valuable outside
the context of your project.

As support for nested branches is integrated into most commonly
used commands, this is actually quite easy to do: simply make
the change to the required files as you normally would! For example::

  edit src/lib/sax/parser.py
  bzr commit -m "fix bug #42 in sax parser"

Note that Bazaar is smart enough to recurse by default into nested
branches, commit changes there, and commit the new nested branch tips
in the current branch. Both commits get the same commit message.

If you want to only commit the change to a nested branch for now, you
can change into the nested branch before running commit like this::

  cd src/lib/sax
  bzr commit -m "fix bug #42 in sax parser"

Alternatively, you can use a selective commit like this::

  bzr commit -m "fix bug #42 in sax parser" src/lib/sax


Reviewing nested tree changes
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

Just like ``commit``, the ``status`` and ``diff`` commands implicitly
recurse into nested trees. In the case of ``status``, it shows both the
nested tree as having a pending change as well as the items within it that have
changed. For example::

   M src/lib/sax
   M src/lib/sax/parser.py

Once again, if you change into a nested tree though, ``status`` and
``diff`` will operate just on that tree and not recurse upwards by
default.


Browsing nested tree history
~~~~~~~~~~~~~~~~~~~~~~~~~~~~

As the branches of nested trees have their own history, the ``log`` command
shows just the history of the containing branch. To see the history for
a nested branch, nominate the branch explicitly like this::

  bzr log src/lib/sax

Note however that ``log -v`` and ``log -p`` on the containing branch
will show what files in nested branches were changed in each revision.


Splitting out a project
~~~~~~~~~~~~~~~~~~~~~~~

If you already have a large project and wish to partition it into
reusable subprojects, use the ``split`` command. This takes an existing
directory and makes it a separate branch. For example, imagine you have
a directory holding UI widgets that another project would like to
leverage. You can make it a separate branch like this::

  bzr split src/uiwidgets

To make the new project available to others, push it to a shared location
like this::

  cd src/uiwidgets
  bzr push bzr://example.com/uiwidgets

You also need to link it back into the original project as a nested branch
using the ``join`` command like this (assuming the current directory is
``src/uiwidgets``)::

  bzr join --nested .
  bzr commit -m "uiwidgets is now a nested project"

Similar to ``branch --nested``, ``join --nested`` joins the nominated directory
(which must hold a branch) into the containing tree.  In order to make sure
that all versions of a tree can be reproduced, the branches of nested trees
share a repository with their containing tree.


Virtual projects
~~~~~~~~~~~~~~~~

By design, Bazaar is strict about tracking the actual revisions used of
nested branches over time. Without this, projects cannot accurately
reproduce exactly what was used to make a given build. There are
isolated use cases though where is advantageous to say "give me the
latest tip of these loosely coupled branches". To do this, create a
small 'virtual project' which is just a bunch of *unpegged* nested
branches. To mark nested branches as unpegged, use the ``--no-pegged``
option of the ``join`` command like this::

  bzr join --nested --no-pegged [DIR]

To stop the nested branch tips from floating and to begin recording
the tip revisions again, use the ``pegged`` option::

  bzr join --nested --pegged [DIR]

After changing whether one or more nested branches are pegged or not, you
need to ``commit`` the branch to record that metadata. (The pegged state
is recorded over time.)

For example, you may be managing a company intranet site as a project
which is nothing more than a list of unrelated departmental websites
bundled together. You can set this up like this::

  bzr init intranet-site
  cd intranet-site
  bzr branch bzr://ourserver/websites/research
  bzr branch bzr://ourserver/websites/development
  bzr branch bzr://ourserver/websites/support
  bzr branch bzr://ourserver/websites/hr
  bzr join --nested --no-pegged research
  bzr join --nested --no-pegged development
  bzr join --nested --no-pegged support
  bzr join --nested --no-pegged hr
  bzr commit -m "initial configuration of intranet-site"

Publishing the overall site is then as easy as going to the server
hosting your intranet and running something like::

  bzr branch http://mymachine//projects/intranet-site

Refreshing the overall site is as easy as::

  bzr pull

Virtual projects are also useful for providing a partial 'view' over
a large project containing a large number of subprojects. For example,
you may be working on an office suite and have a bunch of developers
that only care about the word processor. You can create a virtual
project for them like this::

  bzr init wp-modules
  cd wp-modules
  bzr branch ../common
  bzr branch ../printing
  bzr branch ../spellchecker
  bzr branch ../wordprocessor
  bzr join --nested --no-pegged common
  bzr join --nested --no-pegged printing
  bzr join --nested --no-pegged spellchecker
  bzr join --nested --no-pegged wordprocessor
  bzr commit -m "initial configuration of wp-modules"

Those developers can then get bootstrapped faster and have *just* the
subprojects they care about by branching from ``wp-modules``.


Nested branch tips & tricks
~~~~~~~~~~~~~~~~~~~~~~~~~~~

As explained above, most of Bazaar's commonly used commands recurse
downwards into nested branches by default. To prevent this recursion,
use the ``--no-recurse-nested`` option on various commands (including
``commit``, ``status`` and ``diff``) that support it.

Thanks to plugins like bzr-svn and bzr-git, Bazaar has strong support
for transparently accessing branches managed by foreign VCS tools. This
means that Bazaar can support projects where nested branches are hosted
in supported foreign systems. For example, to nest a library maintained
in Subversion::

  bzr branch --nested svn://example.com/xmlhelpers src/lib/xmlhelpers

If you want revisions to be committed both to a remote location and a
local location, make the top-level branch a bound branch.  (Nested branches
have no configuration of their own.)

Most likely, you will have some branches that are identical to their upstream
version and can be pulled, and some that have local changes and must be merged.
You can update all of them at once using ``merge --pull``.  This will pull
into the trees with no local changes, and merge into the ones with local
changes.  Afterward, you should commit, which will commit only into the
trees that were merged into.

As you'd expect, a nested branch can be moved or deleted using the
normal commands. For example, after splitting out a subproject, you
may want to change its location like this::

  bzr mv src/uiwidgets src/lib/uiwidgets
  bzr commit -m "move uiwidgets into src/lib"

Things to be aware of
~~~~~~~~~~~~~~~~~~~~~

Commands like ``commit`` and ``push`` need online access to the locations
for nested branches which have updated their tip. In particular, ``commit``
will update any changed nested branches first and only commit to the
containing branch if all nested branch commits succeed. If you are working
offline, you may want to ensure you have a local mirror location defined
for nested branches you are likely to tweak. Alternatively, the
``no-recurse-nested`` option to the ``commit`` command might to useful to
commit some changes, leaving the nested branch commits until you are back
online.

At the moment, nested trees need to be incorporated as a whole.
Filtered views can be used to restrict the set of files and directories
logically seen. Currently though, filtered views are a lens onto a tree:
they do not delete other files and the exposed files/directories must
have the same paths as they do in the original branch. In the future,
we may add support for nesting and moving selected files from a
(read-only) nested branch something like this::

  bzr nested DIR --file LICENSE --file doc/README::README
  bzr commit -m "change which files are nested from project DIR"

If you require this feature, please contact us with your needs.

Design decisions
****************

The branches of subtrees shall either share a repository with the containing tree,
or the containing tree's repository will be (implicitly) added as a fallback
repository.

The branches of subtrees shall be in a special format that shares a single
last_revision file that is stored in the containing branch.

The subtree branches shall be referenced in the last_revision file by file-id.

Subtree branches shall not support individual configuration.

Fetch shall automatically fetch the revisions mentioned by tree-references,
recursively.

The reserved revision-id "head:" shall be used in tree-references to refer to
the tip revision of a branch.

bzr-svn repositories with externals shall behave as though the multiple
repositories were a single Bazaar repository with multiple branches.

Shall commands recurse downwards by default?
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

Yes.

Pros:

 - It is hard to accidentally produce inconsistent trees
 - Inconsistent trees are hard for remote users to handle
 - Accidentally committing too many things at once is easy to resolve
 - It is hard to accidentally commit too many things at once

Cons:

 - Accidentally committing nuclear launch codes is easier to do
 - A commit message that makes sense for the top may not make sense lower down.


Shall commands recurse upwards by default?
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

No.


Shall subtree branches be addressable?
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

Ideally, yes. We might want to use the path segment parameters syntax here too.


Shall we model nested trees as a composite tree?
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

Yes.  Users will see recurse-downwards behaviour that allows operations that
cross subtree boundaries, e.g. a merge in the top tree can move a file between
subtrees.

The downside is that we can't have cheap support for subtrees that are copies
of one another, because we wouldn't know which copy to apply sets of changes
to.


Shall we use root-ids for tree references?
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

Yes.  This fits well with our current lack of support of file copies.  If we do
support file copies in future it will be possible to change this in a future
format, and perform deterministic upgrades to that format.


What about locking?
~~~~~~~~~~~~~~~~~~~

We should lock recursively.  It matches existing behaviour by failing earlier,
and the extra cost does not seem onerous.  (To be fully efficient this requires
an index of the subtrees, otherwise we need to scan the fully
inventory/dirstate.)

(Also, this decision can be changed later with no compatibility concerns.)


How do we handle merge when the subtree hasn't diverged?
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

"bzr merge --pull" will be changed so that it will merge (not pull) when the
local last revision's revno would change (i.e. is a non-lhs parent in the merge
source).  This is expected to be the most common way to update nested trees.

The existing "bzr merge --pull" behaviour will be renamed to "bzr merge
--pull-renumber".

"bzr merge" (with no "--pull") will do a merge in all trees.  "bzr pull" will
do a pull in all trees.

The rationale is that a very common use-case is that the top tree is a project
the user is actively committing to, and the subtrees are mainly libraries that
are being mirrored.  So a behaviour that forced every update to be a merge
would be undesirable for the mirrored subtrees, but an update that is a pull
wouldn't suit the changing top tree.  And the existing "merge --pull" (that can
renumber revisions) isn't desireable for either the top tree or subtrees in
this case.


What should uncommit do?
~~~~~~~~~~~~~~~~~~~~~~~~

It will recurse, and subtrees will be uncommitted back to the revision recorded
by the revision the top tree is uncommitting to.

This means that operations like::

   $ echo foo > versioned-file-in-top-tree.txt
   $ bzr ci -m "Change file"
   $ bzr uncommit

will not cause a change in subtrees, since the top-level commit did not affect
them.  But on the other hand:

   $ echo foo > subtree/versioned-file-in-subtree.txt
   $ bzr ci -m "Change file"
   $ bzr uncommit

will first uncommit to the subtree, then to the top tree.  The uncommit will
restore both trees to their previous state.


Some subtrees should have commits and some should not.  How?
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

We will not provide special support for this initially.  We might later support
flagging some sub-trees as mirror-only or something similar, but this seems like
it could be a general feature not specific to nesting.  (and it may only require
a working tree format bump to add).

Comparison with other systems
*****************************

Git submodules
~~~~~~~~~~~~~~

This allows separate repositories to be used for submodules.

Mercurial Forests
~~~~~~~~~~~~~~~~~

The wiki page does not give confidence that this is a well-maintained project.
It seems similar to config-manager-- its 'snapshot' files are like
config-manager's config files, describing what branches to get and where to put
them, optionally specifying a revision.  No metadata about nesting is stored in
the tree.  Optionally, a 'snapshot.txt' file may be stored in the containing
tree, but it can also be stored somewhere else.

There is no attempt to integrate subtree support into the core commands.

Mercurial Nested Repositories
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
This design was for an integrated feature, but there are apparently 4
implementations as extensions.  While it does integrate subtree support into
core commands, this may be off by default: "The alternative that I lean towards
is to not recurse unless explicitly instructed to. Most probably, only a few
commands should arguably even be aware of modules."

Command comparison:

- hg module add ~= bzr join --nested
- hg module remove ~= bzr remove --keep
- hg module record's functionality is part of nested commits in nested trees.

Like submodules, this stores location information in versioned data: a
.hgmodules directory.
Like submodules and nested trees, particular revisions are recorded.

Subversion "svn:externals"
~~~~~~~~~~~~~~~~~~~~~~~~~~
Like bzr, uses per-file metadata.  Like submodules and nested repositories,
locations are versioned data.  Like Forests, revisions are optional.  Like
nested repositories, there is limited integration into core commands; checkout
and update support externals, and commit may support them in the future.
However, there is no UI specific to creating and updating svn:externals
references.

Unlike all other alternatives, supports partial checkouts.  This is because svn
natively supports partial checkouts.  Also, supports checkouts of tags, because
tags are merely a convention in svn.

Supports pulling in "head" subtrees too, not just specific ("known-good")
revisions.  Nested trees supports this in order to provide high-fidelity
imports.

Some support for single-file svn:externals (see
http://subversion.tigris.org/svn_1.6_releasenotes.html#externals), whereas bzr
subtrees must be directories.

Has a --ignore-externals option for checking out without pulling in the
svn:externals items (svn checkout is more or less equivalent to bzr branch).
You can also use that option on update (more or less like bzr merge).


Comments on differences
~~~~~~~~~~~~~~~~~~~~~~~
externals support partial checkouts.  Nested trees could gain support for this
once Bazaar itself supports partial checkouts.  Supporting a single file as a
"subtree" would also depend on native bzr support.  On platforms that support
symlinks, using symlinks to portions of a subtree can be an effective
substitute.

.. vim: ft=rst