I think the compositing happens in the Zoom client now.  You can
switch among modes that show a mosaic of people and one that has a big
picture of the speaker and mosaic of other people, with the mosaics
all scrollable in large conferences.  The switching seems instant to
me which suggests that the client is doing it.  The background
substitution definitely happens in the client since they disable it if
your computer isn't fast enough.

It's pretty clear that Zoom thought of their market as B2B where the
image and other processing are managed by contract rather than by
technology.  Now they're finding that a lot of their new users, and
some of their surprised old users, don't like that.

