docs: opengl. Add some great documentation provided by Roudolf.

* Posted to the haiku-appserver ML, circa 2005. * Covers a lot of intended uses of Be's accelerant API's in depth and compares them to moden cards
2014-08-19 10:28:53 -04:00 · 2014-08-19 10:28:53 -04:00 · 4662c8c0a9
commit 4662c8c0a9
parent f66aa8595c
1 changed files with 136 additions and 0 deletions
--- a/docs/develop/opengl/accelerant-interfaces
+++ b/docs/develop/opengl/accelerant-interfaces
@ -0,0 +1,136 @@
+Posted by Rudolf Nov 22, 2005 to the haiku-appsever ML:
+http://www.freelists.org/post/haiku-appserver/new-drawing-bug-Rudolf-some-stuff-for-you,10
+
+More good info @ http://www.freelists.org/archive/haiku-appserver
+
+        Be Docs (file: R4_Graphics_Driver_Docs):
+        
+        Engine Synchronization
+
+        B_ACCELERANT_ENGINE_COUNT  -  No feature specific data required.
+        Return the number of acceleration engines that the device may operate in parallel.
+        It's not required for all engines to be equally capable (i.e. support the same
+        acceleration features).
+
+        B_ACQUIRE_ENGINE  -  No feature specific data required.
+        Request exclusive ownership of an acceleration engine with the capabilities mask specified.
+        The caller is willing to wait up to max_wait micro(milli?)-seconds. If the request can't be
+        fullfilled before that time expires, the accelerant should return B_WOULD_BLOCK immediatly.
+        If non-zero, sync_token points to a synchronization token retrieved from either
+        release_engine() or get_sync_token() to which the engine should be synchronized before
+        engine acquisition succeeds.  See B_SYNC_TO_TOKEN for more details.  The engine_token for
+        the successfully acquired engine is returned in engine_token **et.
+
+        B_RELEASE_ENGINE  -  No feature specific data required.
+        Relinquish exclusive ownership of the engine specified by engine_token et.  If
+        sync_token *st is non-zero, return a sync_token which can be utilized to ensure that the
+        specified engine has completed all acceleration operations issued up this point in time.
+
+        B_WAIT_ENGINE_IDLE  -  No feature specific data required.
+        Wait for the graphics device to be completely idle (i.e. no current running or pending
+        acceleration primitives, DMA transfers, etc.).
+
+        B_GET_SYNC_TOKEN  -  No feature specific data required.
+        Return a synchronization token for the specified engine.  sync_token *st must point at
+        a sync_token which will be updated with the information required to ensure that a call to
+        sync_to_token() will be able to ensure that the specified engine has completed all of the
+        acceleration primitives, DMA transfers, etc. that have been issued up to this point in time.
+
+        B_SYNC_TO_TOKEN  -  No feature specific data required.
+        Ensure that the engine specified in sync_token *st has completed the acceleration
+        primitives, DMA transfers, etc., issued up to the point in time specified in the sync_token.
+
+
+        Rudolf's notes.        
+        info on engine_token:
+
+        uint32 ACCELERANT_ENGINE_COUNT(void)
+        exists because in theory a single card can have multiple independant acceleration engines.
+        For instance one that can do 2D, and one that can do 3D (or combinations). See
+        'engine capabilities' flags in Accelerant.h.
+        (note: never seen multiple engines per card     yet though. Was this meant for some very old
+        hardware with seperate 2D and 3D 'blocks'? Although I guess it's thinkable that multiple
+        equally capable engines would exist as well..)
+
+        In order to distinquish between multiple engines all acceleration commands are given along
+        with an engine_token *et, aquired when
+        status_t ACQUIRE_ENGINE(uint32 capabilities, uint32 max_wait, sync_token *st, engine_token **et)
+        was called.
+        status_t SYNC_TO_TOKEN(sync_token *st)
+        is an exeption: here the engine is ID'd by member: (defined in Accelerant.h)
+        uint32  engine_id.
+        void WAIT_ENGINE_IDLE(void)
+        is an exeption also because this function returns only when *ALL* engines are completely
+        idle. Hence no need for distinction.
+        
+        Furthermore the absense of the use of engine_token in both SYNC_TO_TOKEN and WAIT_ENGINE_IDLE
+        would seem to indicate these hooks may be used *without having acquired the engine*. (?).
+
+
+        info on sync_token:
+
+--      How does the acceleration cmd interface work? There's a circular buffer that stores cmd's.
+        There's a hardware pointer that points 'at' the command currently being executed (think
+        of stacks: some architectures point to the first 'free' location, some to the 'last used'
+        location.)
+        There's also a second pointer which points at the first free location, i.e. where new cmd's
+        will be stored pending execution. This second pointer is a software maintained pointer (in
+        the driver).
+
+--      What's a sync_token? A Sync_token in nothing more than a extra pointer (one per token). This
+        pointer points at the first free location in the cmd buffer at that point in time the 
+        sync_token was 'filled'. In theory it doesn't change during the (rest of the) life-time of
+        the token, although the driver-implementation could (?) do that anyway for some internal
+        reason.
+        Driver users (i.e. app_server) are responsible for reserving memory for a sync_token. They
+        pass a pointer to it to the driver (if they want to use it). The driver in turns fills it
+        with the needed info. Driver users may never modify the content of the sync_token(s).
+        
+--      When would a user be interested in a sync_token? If multiple independant (so non-overlapping)
+        'regions' require multiple updates each alternatingly done by software and acceleration engine,
+        it might be interesting to use sync_tokens (for instance).
+        It would be possible to issue all engine commands concerning area #1, ask for a sync token,
+        issue all engine commands concerning area #2, and then do this:
+        - SYNC_TO_TOKEN (so waiting until all engine commands concerning area #1 are done);
+        - Draw in area #1 using software (while the acc engine is in the process of updating area #2:
+         (so some 'parallel processing' is done here).
+
+        If no sync token would be used, instead of syncing to token, wait_engine_idle would be used.
+        This of course would mean that no 'parallel processing' could be done...
+        Worse yet: if some seperate user is also using the engine (between RELEASE and AQUIRE engine
+        done by 'user #1' (i.e. app_server), the acc engine might *never* become fully idle.
+        A nice example here would be 3D acceleration: as long as no non-accelerated drawing has to be
+        done there, there's no reason the engine would need to be idle at all for this (apart from
+        processing 'user-input' like joystick controls).
+        You see: an acceleration engine automatically serializes it's cmd processing so update errors
+        because of out-of-order execution wouldn't happen.
+        
+--      Why didn't I implement sync_token stuff in the matrox, nvidia and neomagic drivers?
+        Lack of 'known specs' yet. You see, even if the pointers in the cmd buffer are known: that's 
+        not enough yet:
+        - The fact that an command is fetched from the buffer does *not* mean it's
+        execution is completely done.
+        - The implementation via just a pointer is not enough: if you wait too long, the current
+        free pointer might have cycled around the buffer. This means a simple compare to that
+        pointer is not conclusive. Much better would be actually inserting 'dummy commands' inside
+        the command buffer at the place a sync_token is generated. This dummy commands would be 
+        executed just once, when we reached our goal. If the dummy command clears a variable 
+        especially setup for the sync_token in question: we would have a conclusive result without
+        even the use of an actual pointer in the sync_token. AND: updating the tokens would have no
+        software overhead, as the acc engine would do it. (via 'pointers only' some invalidation
+        code has to be executed once the hardware pointer cycles around).
+
+        NOTE PLEASE:
+        This information is based upon my understanding of hardware inner workings. As this 
+        understanding grows and is corrected over time, I might contradict myself later on if
+        asked again. :-)         
+
+
+//actual hooks:
+        uint32 ACCELERANT_ENGINE_COUNT(void);
+        status_t ACQUIRE_ENGINE(uint32 capabilities, uint32 max_wait, 
+sync_token *st, engine_token **et);
+        status_t RELEASE_ENGINE(engine_token *et, sync_token *st);
+        status_t GET_SYNC_TOKEN(engine_token *et, sync_token *st);
+        status_t SYNC_TO_TOKEN(sync_token *st);
+        void WAIT_ENGINE_IDLE(void);