Okay I think I've gotten through window/content processes

microsoft · Aug 5, 2020 · 3f7688d · 3f7688d
1 parent 69abf3c
commit 3f7688d
Showing 1 changed file with 140 additions and 44 deletions.
diff --git a/doc/specs/#5000 - Process Model 2.0/#5000 - Process Model 2.0.md b/doc/specs/#5000 - Process Model 2.0/#5000 - Process Model 2.0.md
@@ -32,6 +32,59 @@ browser process model. For a web browser, there's often a seperate process for
 each of the tabs within the browser, to help isolate the actual tabs from one
 another.
 
+## Background
+
+In order to better understand some of the following technical solutions, it's
+important to understand some of the technical hurdles that need to be overcome.
+
+### Mixed admin and unelevated clients in a single window
+
+Let's presume that you're a user who wants to be able to open an elevated tab
+within an otherwise unelevated Terminal window. We call this scenario "mixed
+elevation" - the tabs within the Terminal can be running either unelevated _or_
+elevated client applications.
+
+It wouldn't be terribly difficult for the unelevated Terminal to request the
+permission of the user to spawn an elevated client application. The user would
+see a UAC prompt, they'd accept, and then they'd be able to have an elevated
+shell alongside their unelevated tabs.
+
+However, this creates an escalation of priviledge vector. Now, there's an
+unelevated window which is connected directly to an elevated process. At this
+point, any other unelevated application could send input to the Terminal's
+`HWND`, making it possible for another unelevated process to "drive" the
+Terminal window and send commands to the elevated client application.
+
+### Drag and drop tabs to create new windows
+
+Another important scenario we're targeting is the ability to drag a tab out of
+the Terminal window and create a new Terminal window. Obviously, when we do
+this, we want the newly created window to be able to persist all the same state
+that the original window had for that tab. For us, that primarily means the
+buffer contents and connection state.
+
+However, _how_ do we move the terminal state to another window? The terminal
+state is all located in-memory of the thread that created the `TermControl`
+hosting the terminal buffer. It's fairly challenging to re-create this state in
+another process.
+
+Take a look at [this
+document](https://github.com/windows-toolkit/Sample-TabView-TearOff) for more
+background on how the scenario might work.
+
+There's really only a limited selection of things that a process could transfer
+to another with a drag and drop operation. If we wanted to use a string to
+transfer the data, we'd somehow need to serialize then entire state of the tab,
+it's tree of panes, and the state of each of the buffers in the tab, into some
+sort of string, and then have the new window deserialize that string to
+re-create the tab state. Consider that each bufer might include 32000 rows of
+80+ characters each, each with possibly RGB attributes, and you're looking at
+30MB+ of raw data to serialize and de-serialize per buffer, minimum. This sounds
+extremely fragile, if not impossible to do robustly.
+
+What we need is a more effective way for seperate Terminal windows to to be able
+to connect to and display content that's being hosted in another process.
+
 ## Solution Design
 
 ### Window and Content Processes
@@ -48,37 +101,39 @@ will be referred to as the "Window Process" and the "Content Process".
 
 These two types of processes will work together to present both the UI of the
 Windows Terminal app, as well as the contents of the terminal buffers. A single
-WP may be in communication with multiple CPs - one per terminal instance. That
-means that each and every `TermControl` in a WP will be hosted in a seperate
-process.
+window process may be in communication with multiple content processes - one per
+terminal instance. That means that each and every `TermControl` in a window
+process will be hosted in a seperate process.
 
-The WP will be full of "thin" `TermControl`s - controls which are only the XAML
-layer and a WinRT object which is hosted by the CP. These thin `TermControl`s
-will recieve input, and have all the UI elements (including the
-`SwapChainPanel`) of the `TermControl` today, but the actual _rendering_ to the
-swap chain, and the handling of those inputs will be done by the content
-process.
+The window process will be full of "thin" `TermControl`s - controls which are
+only the XAML layer and a WinRT object which is hosted by the content process.
+These thin `TermControl`s will recieve input, and have all the UI elements
+(including the `SwapChainPanel`) of the `TermControl` today, but the actual
+_rendering_ to the swap chain, and the handling of those inputs will be done by
+the content process.
 
 As a broad outline, whenever the window wants to create a terminal, the flow
 will be something like the following:
 
-1. A WP will spawn a new CP, with a unique ID.
-2. The WP will attach itself to the CP, indicating that it (the WP) is the CP's
-   hosting window.
-3. When the CP creates it's swap chain, it will raise an event which the WP will
-   use to connect that swap chain to the WP's `SwapChainPanel`.
+1. A window process will spawn a new content process, with a unique ID.
+2. The window process will attach itself to the content process, indicating that
+   it (the window process) is the content process's hosting window.
+3. When the content process creates it's swap chain, it will raise an event
+   which the window process will use to connect that swap chain to the window
+   process's `SwapChainPanel`.
 4. The will process output from the connection and draw to the swap chain. The
-   contents that are rendered to the swap chain will be visible in the WP's
-   `SwapChainPanel`, because they share the same underlying kernel object.
+   contents that are rendered to the swap chain will be visible in the window
+   process's `SwapChainPanel`, because they share the same underlying kernel
+   object.
 
-The CP will be responsible for the terminal buffer and other terminal state (the
-core `Terminal` object), as well as the `Renderer`, `DxEngine`, and
-`TerminalConnection`. These are all being combined in the CP, as to maximize
-performance. We don't want to have to hop across the process boundary multiple
-times per frame, so the renderer must be in the same process as the buffer.
-Similarly, we want to be able to read data off of the connection as quickly as
-possible, and the best way to do this will be to have the connection in the same
-process as the `Terminal` core.
+The content process will be responsible for the terminal buffer and other
+terminal state (the core `Terminal` object), as well as the `Renderer`,
+`DxEngine`, and `TerminalConnection`. These are all being combined in the
+content process, as to maximize performance. We don't want to have to hop across
+the process boundary multiple times per frame, so the renderer must be in the
+same process as the buffer. Similarly, we want to be able to read data off of
+the connection as quickly as possible, and the best way to do this will be to
+have the connection in the same process as the `Terminal` core.
 
 
 #### Technical Details
@@ -116,32 +171,49 @@ auto myClass = create_instance<winrt::MyClass>(MyClassGUID, CLSCTX_LOCAL_SERVER)
 
 We're going to be using that system a little differently here. Instead of using
 a GUID to represent a single _Class_, we're going to use the GUID to uniquely
-identify _CPs_. Each CP will recieve a unique GUID when it is created, and it
-will register as the server for that GUID. Then, any WP will be able to connect
-to that specific CP strictly by GUID. Because each GUID is unique to each
-content process, any time any client calls `create_instance<>(theGuid, ...)`, it
-will uniquely attempt to connect to the content process hosting `theGuid`.
+identify _content processes_. Each content process will recieve a unique GUID
+when it is created, and it will register as the server for that GUID. Then, any
+window process will be able to connect to that specific content process strictly
+by GUID. Because each GUID is unique to each content process, any time any
+client calls `create_instance<>(theGuid, ...)`, it will uniquely attempt to
+connect to the content process hosting `theGuid`.
 
-This means that if we wanted to have a second WP connect to the _same_ CP as
-another window, all it needs is the CP's GUID.
+This means that if we wanted to have a second window process connect to the
+_same_ content process as another window, all it needs is the content process's
+GUID.
 
 #### Scenario: Tab Tearoff/ Reattach
 
-Oh the flow between these sections is less than stellar. I need to introduce why moving object is hard first, then talk about just moving guids
+Because all that's needed to uniquely identify an individual terminal instance
+is the GUID of the content process, it becomes fairly trivial to be able to
+"move" a terminal instance from one window to another.
+
+When a drag/drop operation happens, the payload of the event will simply contain
+the structure of the tree of panes, and the GUIDs of the content processes that
+make up the leaf nodes of the tree. The recieving window process will then be
+able to use that tree to be able to re-create a similar pane structure in its
+window, and use the GUIDs to be able to connect to the content processes for the
+terminals in that tab.
+
+The terminal buffer never needs to move from one process to another - it always
+stays in just a single content process. The only thing that changes is the
+window process which is rendering the content process's swapchain.
 
 #### Scenario: Mixed Elevation
 
 ### Monarch and Servant Processes
 
-With the current design, it's easy to connect many CPs to a WP, and move those
-CPs easily between windows. However, we want to make sure that it's also
-possible to communicate between these processes. What if we want to have only a
-single WT instance, so that whenever Wt is run, it creates a tab in the existing
-window? What if you want to run a commandline in a given WT window?
+With the current design, it's easy to connect many content processes to a window
+process, and move those content processes easily between windows. However, we
+want to make sure that it's also possible to communicate between these
+processes. What if we want to have only a single WT instance, so that whenever
+Wt is run, it creates a tab in the existing window? What if you want to run a
+commandline in a given WT window?
 
 In addition to the concept of Window and Content Processes, we'll also be
-introducing another type of categorization for WPs. These are "Monarch" and
-"Servant" processes. This will allow for the coordination across the various windows by the Monarch.
+introducing another type of categorization for window processes. These are
+"Monarch" and "Servant" processes. This will allow for the coordination across
+the various windows by the Monarch.
 
 
 
@@ -150,7 +222,11 @@ introducing another type of categorization for WPs. These are "Monarch" and
 
 #### Scenario: Single Instance Mode
 
-In Single Instance mode, the monarch _is_ the single instance. When `wt` is run, and it determines that it is not the monarch, it'll as the monarch if it's in single instance mode. If it is, then the servant that's starting up will instead pass it's commandline arguments to the monarch process, and let the monarch handle them, then exit.
+In Single Instance mode, the monarch _is_ the single instance. When `wt` is run,
+and it determines that it is not the monarch, it'll as the monarch if it's in
+single instance mode. If it is, then the servant that's starting up will instead
+pass it's commandline arguments to the monarch process, and let the monarch
+handle them, then exit.
 
 #### Scenario: Run `wt` in the current window
 
@@ -219,7 +295,7 @@ break down the work required into atomic pieces, and provide a relative ordering
 for the work described above.
 
 
-1. Add Monarch/Servant capabilities to `wt` WPs.
+1. Add Monarch/Servant capabilities to `wt` window processes.
   - This does not need to involve any actual UX functionality, simply have the
     WT instances communicate with one another to see who is the Monarch.
   - The monarch will track and assign IDs to servants.
@@ -234,12 +310,31 @@ for the work described above.
 
 * Yea add these
 
-
+* Non-terminal content in the Terminal. We've now created a _very_
+  terminal-specific IPC mechanism for transfering _terminal_ state from one
+  thread to another. When we start to plan on having panes that contain
+  non-terminal content in them, they won't be able to move across-process like
+  this. Options available here include:
+  - Passing some well-defined string / JSON blob from the source process to the
+    target process, such that the extension could use that JSON to recreate
+    whatever state the pane was last in. The extension would be able to control
+    this content entirely.
+      - Maybe they want to pass a GUID that the new process will be able to user
+        to `CreateInstance` the singleton for their UI and then dupe their swap
+        chain to the new thread...
+  - Preventing tabs with non-terminal content open in them from being dragged
+    out entirely. They're stuck in their window until the non-terminal content
+    is removed.
+* A previous discussion with someone on the input team brought up the following
+  idea: When a tab is being torn out, move all the _other_ tabs to a new window,
+  and continue dragging the current window. This should make the drag/drop
+  experience feel more seamless, and might be able to allow us to render the tab
+  content as we're drag/dropping.
 
 ## TODOs
 
 * Prove that a elevated window can host an unelevated content
-* Prove that I can toss CP IDs from one window to another
+* Prove that I can toss content process IDs from one window to another
 * come up with a commandline mechanism for starting one `wt.exe` process either
   as a window process, or as a content process
 * What about handling XAML input events? The "thin" term control will need to do
@@ -252,7 +347,8 @@ for the work described above.
 
 ## Resources
 
-* [Tab Tearout in the community toolkit]
+* [Tab Tearout in the community toolkit] - this document proved invaluable to
+  the background of tearing a tab out of an application to create a new window.
 
 <!-- Footnotes -->