< Previous by Date	Date Index	Next by Date >
< Previous in Thread	Thread Index	Next in Thread >

Re: [reSIProcate] MHS Problems.

From: Byron Campen <bcampen@xxxxxxxxxxxx>
Date: Fri, 11 Jul 2008 10:07:44 -0500

I think we should allow the max header length to be configurable.Should we restrict by default?

I believe my latest patch addresses the "efficient use of memory"problem without adding an 'assignment detection' concept. If I ammissing something, please let me know.


        Also, we could allow other limitations to be configured, like:

- Maximum number of header field values per message (maybe even on aper header type basis)

- Maximum number of parameters per header field value
- Maximum number of bytes in body

Best regards,
Byron Campen

As my message points out, there is potentially a DOS / bufferhandling error in the MHS class due to the lack of 'assignmentdetection' at present on long header field values.

I'm calling this out from the earlier thread since this (like thebody problem Byron points out) is a real issue that we shouldjointly agree on how to solve.


My suggestions:

There COULD be a max header length setting, should be settable inthe API.There SHOULD be efficient use of memory to prevent lots of wastedspace in busy systems (this favors smaller buffers).Given that we favor smaller buffers, we need to ensure that anyunused buffers are freed (unlike now).


Thoughts?

Alan

Here's the original posting that I had on this problem...
Thanks to Markus for setting the wheels in motion.

        Message-Id:     <A0720461-B4A5-4051-8379-F4C69A1A9C56@xxxxxxxxxxxx>



On Jul 10, 2008, at 10:39 , Byron Campen wrote:

Actually, this patch will cause heap corruption, since mMessagehas taken ownership of mBuffer. A better one would be:
Index: resip/stack/ConnectionBase.cxx
===================================================================
--- resip/stack/ConnectionBase.cxx      (revision 8120)
+++ resip/stack/ConnectionBase.cxx      (working copy)
@@ -176,16 +176,25 @@
          else
          {
// ...but some of the chunk must be shifted into thenext one.
-               size_t size = numUnprocessedChars*3/2;
-               if (size < ConnectionBase::ChunkSize)
+               if (numUnprocessedChars >= ConnectionBase::ChunkSize)
             {
-                  size = ConnectionBase::ChunkSize;
+ // .bwc. We have lots of unprocessed chars here;this can be+ // caused if we get a really, really largeheader field value.
+                  // We can safely reject this, I think.

I don't think you can reject this -- what if the header is verylarge? Legitimately?I think that in general we should detect that we have an unprocessednumber of bytes > chunkSize and grow chunkSize to accommodate thiscase (doubling or 3/2s at a time) as was the intent of the originalcode...

// ...but some of the chunk must be shifted into thenext one.

             size_t size = numUnprocessedChars*3/2;
             if (size < ConnectionBase::ChunkSize)
             {
                size = ConnectionBase::ChunkSize;
             }
             char* newBuffer = MsgHeaderScanner::allocateBuffer(size);

memcpy(newBuffer, unprocessedCharPtr,numUnprocessedChars);

             mBuffer = newBuffer;
             mBufferPos = numUnprocessedChars;
             mBufferSize = size;
          }

The above is my 'main' branch ConnectionBase code....

Note that the size of the buffer will grow by 3/2s the number ofunprocessed chars. In the case where the header has a value thatcontinues beyond 2 buffers in size we SHOULD see the followingbehavior:


[ a 60 char value for the 3 char header HDR ]

HDR : valuevalu1valuevalu2valuevalu3valuevalu4valuevalu5valuevalu6

With a ChunkSize of 16 (for simplicity's sake)

    0   1   2   3   4   5   6   7   8   9   A   B   C   D   E   F
  +---+---+---+---+---+---+---+---+---+---+---+---+---+---+---+---+
0000| H | D | R |   | : |   | v | a | l | u | e | v | a | l | u | 1 |
  +---+---+---+---+---+---+---+---+---+---+---+---+---+---+---+---+
0010| v | a | l | u | e | v | a | l | u | 2 | v | a | l | u | e | v |
  +---+---+---+---+---+---+---+---+---+---+---+---+---+---+---+---+
0020| a | l | u | 3 | v | a | l | u | e | v | a | l | u | 4 | v | a |
  +---+---+---+---+---+---+---+---+---+---+---+---+---+---+---+---+
0030| l | u | e | v | a | l | u | 5 | v | a | l | u | e | v | a | l |
  +---+---+---+---+---+---+---+---+---+---+---+---+---+---+---+---+
0040| u | 6 |
  +---+---+


chunk0:
    0   1   2   3   4   5   6   7   8   9
  +---+---+---+---+---+---+---+---+---+---+
0000| H | D | R |   | : |   | v | a | l | u |
  +---+---+---+---+---+---+---+---+---+---+
    ^                       ^
    `-- Unknown(l=3)        `-- unprocessedCharPtr


After the first scanner pass, the SIP message has a header pointer
(Unknown Header in this case), that points to chunk0+0x0 and has a
length of 3. Therefore, some of chunk0's contents are being used by
the message The unprocessedCharPtr has value chunk0 + 0x6 in this
case.

The code will then allocate a new buffer, necessarily copying chunk0

into the new buffer, of size unproc*3/2 (6 in this case) -- this isa bit odd since we would expect better allocation related tochunksize, not # unprocessed chars. The latter is random based onthe framing of the headers within the chunk.


chunk1 ends  up being:
    0   1   2   3   4   5
  +---+---+---+---+---+---+
0000| v | a | l | u | e | v |
  +---+---+---+---+---+---+
    ^
    `- unprocessedCharPtr

Another round (which requires that we save chunk1 gets a 9 charbuffer:


    0   1   2   3   4   5   6   7   8
  +---+---+---+---+---+---+---+---+---+
0000| v | a | l | u | e | v | a | l | u |
  +---+---+---+---+---+---+---+---+---+

Another few rounds and we have: chunk(size)
chunk2 (13), chunk3(19),
chunk4 (19), chunk5(28),
chunk6 (42), chunk7(63)

Chunk7 is finally large enough to hold the entire header:

    0   1   2   3   4   5   6   7   8   9   A   B   C   D   E   F
  +---+---+---+---+---+---+---+---+---+---+---+---+---+---+---+---+
0000| v | a | l | u | e | v | a | l | u | 1 | v | a | l | u | e | v |
  +---+---+---+---+---+---+---+---+---+---+---+---+---+---+---+---+
0010| a | l | u | 2 | v | a | l | u | e | v | a | l | u | 3 | v | a |
  +---+---+---+---+---+---+---+---+---+---+---+---+---+---+---+---+
0020| l | u | e | v | a | l | u | 4 | v | a | l | u | e | v | a | l |
  +---+---+---+---+---+---+---+---+---+---+---+---+---+---+---+---+
0030| u | 5 | v | a | l | u | e | v | a | l | u | 6 |
  +---+---+---+---+---+---+---+---+---+---+---+---+

This was a really terrible growth mechanism since it too 7 or socopies to get the data for the header in a contiguous buffer. Worse,as you point out, chunks 1 through 6 are wasted memory.


Here is what we have floating around...

chunk0:
    0   1   2   3   4   5   6   7   8   9
  +---+---+---+---+---+---+---+---+---+---+
0000| H | D | R |   | : |   | v | a | l | u |
  +---+---+---+---+---+---+---+---+---+---+
    ^                       ^
    `-- Unknown(l=3)        `-- unprocessedCharPtr


chunk1:
    0   1   2   3   4   5
  +---+---+---+---+---+---+
0000| v | a | l | u | e | v |
  +---+---+---+---+---+---+
2
    0   1   2   3   4   5   6   7   8
  +---+---+---+---+---+---+---+---+---+
0000| v | a | l | u | e | v | a | l | u |
  +---+---+---+---+---+---+---+---+---+
3
    0   1   2   3   4   5   6   7   8   9   A   B   C
  +---+---+---+---+---+---+---+---+---+---+---+---+---+
0000| v | a | l | u | e | v | a | l | u | 1 | v | a | l |
  +---+---+---+---+---+---+---+---+---+---+---+---+---+
4
    0   1   2   3   4   5   6   7   8   9   A   B   C   D   E   F
  +---+---+---+---+---+---+---+---+---+---+---+---+---+---+---+---+
0000| v | a | l | u | e | v | a | l | u | 1 | v | a | l | u | e | v |
  +---+---+---+---+---+---+---+---+---+---+---+---+---+---+---+---+
0010| a | l | u |
  +---+---+---+
5
    0   1   2   3   4   5   6   7   8   9   A   B   C   D   E   F
  +---+---+---+---+---+---+---+---+---+---+---+---+---+---+---+---+
0000| v | a | l | u | e | v | a | l | u | 1 | v | a | l | u | e | v |
  +---+---+---+---+---+---+---+---+---+---+---+---+---+---+---+---+
0010| a | l | u | 2 | v | a | l | u | e | v | a | l |
  +---+---+---+---+---+---+---+---+---+---+---+---+
6
    0   1   2   3   4   5   6   7   8   9   A   B   C   D   E   F
  +---+---+---+---+---+---+---+---+---+---+---+---+---+---+---+---+
0000| v | a | l | u | e | v | a | l | u | 1 | v | a | l | u | e | v |
  +---+---+---+---+---+---+---+---+---+---+---+---+---+---+---+---+
0010| a | l | u | 2 | v | a | l | u | e | v | a | l | u | 3 | v | a |
  +---+---+---+---+---+---+---+---+---+---+---+---+---+---+---+---+
0020| l | u | e | v | a | l | u | 4 | v | a |
  +---+---+---+---+---+---+---+---+---+---+


If you look in the code ...


      //.jacob. I've discarded the "assigned" concept.

Once upon a time there was a concept of 'assigned' on a Preparse
buffer.  If this was set when you had a fragmentation issue, you could
safely release the buffer since NOTHING in the SIP header is using the
buffer. NOTE WELL: THe first buffer is being used for the header field
name. The second through n-1 buffers are useless duplication.

I think this is a bug that was introduced with the MHS implementation
when the assigned state was taken out of the FSM. Not a big deal and
we can fix it.

Additionally, it would be a great idea for us to change the testcases for connected transports to use a rediculously small chunkSizeto validate that this all works in all cases ( I haven't checkedthat we do this yet ).


Thanks,

Alan  Hawrylyshen
Early reSIProcate Developer and Lurker
On Jul 10, 2008, at 12:24 , Alan Hawrylyshen wrote:


On Jul 10, 2008, at 10:39 , Byron Campen wrote:

Actually, this patch will cause heap corruption, since mMessagehas taken ownership of mBuffer. A better one would be:
Index: resip/stack/ConnectionBase.cxx
===================================================================
--- resip/stack/ConnectionBase.cxx      (revision 8120)
+++ resip/stack/ConnectionBase.cxx      (working copy)
@@ -176,16 +176,25 @@
          else
          {
// ...but some of the chunk must be shifted into thenext one.
-               size_t size = numUnprocessedChars*3/2;
-               if (size < ConnectionBase::ChunkSize)
+ if (numUnprocessedChars >=ConnectionBase::ChunkSize)
             {
-                  size = ConnectionBase::ChunkSize;
+ // .bwc. We have lots of unprocessed charshere; this can be+ // caused if we get a really, really largeheader field value.
+                  // We can safely reject this, I think.

I don't think you can reject this -- what if the header is verylarge? Legitimately?I think that in general we should detect that we have anunprocessed number of bytes > chunkSize and grow chunkSize toaccommodate this case (doubling or 3/2s at a time) as was theintent of the original code...

// ...but some of the chunk must be shifted into thenext one.

             size_t size = numUnprocessedChars*3/2;
             if (size < ConnectionBase::ChunkSize)
             {
                size = ConnectionBase::ChunkSize;
             }

char* newBuffer =MsgHeaderScanner::allocateBuffer(size);memcpy(newBuffer, unprocessedCharPtr,numUnprocessedChars);

             mBuffer = newBuffer;
             mBufferPos = numUnprocessedChars;
             mBufferSize = size;
          }

The above is my 'main' branch ConnectionBase code....


[ a 60 char value for the 3 char header HDR ]

HDR : valuevalu1valuevalu2valuevalu3valuevalu4valuevalu5valuevalu6

With a ChunkSize of 16 (for simplicity's sake)

    0   1   2   3   4   5   6   7   8   9   A   B   C   D   E   F
  +---+---+---+---+---+---+---+---+---+---+---+---+---+---+---+---+
0000| H | D | R |   | : |   | v | a | l | u | e | v | a | l | u | 1 |
  +---+---+---+---+---+---+---+---+---+---+---+---+---+---+---+---+
0010| v | a | l | u | e | v | a | l | u | 2 | v | a | l | u | e | v |
  +---+---+---+---+---+---+---+---+---+---+---+---+---+---+---+---+
0020| a | l | u | 3 | v | a | l | u | e | v | a | l | u | 4 | v | a |
  +---+---+---+---+---+---+---+---+---+---+---+---+---+---+---+---+
0030| l | u | e | v | a | l | u | 5 | v | a | l | u | e | v | a | l |
  +---+---+---+---+---+---+---+---+---+---+---+---+---+---+---+---+
0040| u | 6 |
  +---+---+


chunk0:
    0   1   2   3   4   5   6   7   8   9
  +---+---+---+---+---+---+---+---+---+---+
0000| H | D | R |   | : |   | v | a | l | u |
  +---+---+---+---+---+---+---+---+---+---+
    ^                       ^
    `-- Unknown(l=3)        `-- unprocessedCharPtr


After the first scanner pass, the SIP message has a header pointer
(Unknown Header in this case), that points to chunk0+0x0 and has a
length of 3. Therefore, some of chunk0's contents are being used by
the message The unprocessedCharPtr has value chunk0 + 0x6 in this
case.

The code will then allocate a new buffer, necessarily copying chunk0


chunk1 ends  up being:
    0   1   2   3   4   5
  +---+---+---+---+---+---+
0000| v | a | l | u | e | v |
  +---+---+---+---+---+---+
    ^
    `- unprocessedCharPtr

Another round (which requires that we save chunk1 gets a 9 charbuffer:


    0   1   2   3   4   5   6   7   8
  +---+---+---+---+---+---+---+---+---+
0000| v | a | l | u | e | v | a | l | u |
  +---+---+---+---+---+---+---+---+---+

Another few rounds and we have: chunk(size)
chunk2 (13), chunk3(19),
chunk4 (19), chunk5(28),
chunk6 (42), chunk7(63)

Chunk7 is finally large enough to hold the entire header:

    0   1   2   3   4   5   6   7   8   9   A   B   C   D   E   F
  +---+---+---+---+---+---+---+---+---+---+---+---+---+---+---+---+
0000| v | a | l | u | e | v | a | l | u | 1 | v | a | l | u | e | v |
  +---+---+---+---+---+---+---+---+---+---+---+---+---+---+---+---+
0010| a | l | u | 2 | v | a | l | u | e | v | a | l | u | 3 | v | a |
  +---+---+---+---+---+---+---+---+---+---+---+---+---+---+---+---+
0020| l | u | e | v | a | l | u | 4 | v | a | l | u | e | v | a | l |
  +---+---+---+---+---+---+---+---+---+---+---+---+---+---+---+---+
0030| u | 5 | v | a | l | u | e | v | a | l | u | 6 |
  +---+---+---+---+---+---+---+---+---+---+---+---+

This was a really terrible growth mechanism since it too 7 or socopies to get the data for the header in a contiguous buffer.Worse, as you point out, chunks 1 through 6 are wasted memory.


Here is what we have floating around...

chunk0:
    0   1   2   3   4   5   6   7   8   9
  +---+---+---+---+---+---+---+---+---+---+
0000| H | D | R |   | : |   | v | a | l | u |
  +---+---+---+---+---+---+---+---+---+---+
    ^                       ^
    `-- Unknown(l=3)        `-- unprocessedCharPtr


chunk1:
    0   1   2   3   4   5
  +---+---+---+---+---+---+
0000| v | a | l | u | e | v |
  +---+---+---+---+---+---+
2
    0   1   2   3   4   5   6   7   8
  +---+---+---+---+---+---+---+---+---+
0000| v | a | l | u | e | v | a | l | u |
  +---+---+---+---+---+---+---+---+---+
3
    0   1   2   3   4   5   6   7   8   9   A   B   C
  +---+---+---+---+---+---+---+---+---+---+---+---+---+
0000| v | a | l | u | e | v | a | l | u | 1 | v | a | l |
  +---+---+---+---+---+---+---+---+---+---+---+---+---+
4
    0   1   2   3   4   5   6   7   8   9   A   B   C   D   E   F
  +---+---+---+---+---+---+---+---+---+---+---+---+---+---+---+---+
0000| v | a | l | u | e | v | a | l | u | 1 | v | a | l | u | e | v |
  +---+---+---+---+---+---+---+---+---+---+---+---+---+---+---+---+
0010| a | l | u |
  +---+---+---+
5
    0   1   2   3   4   5   6   7   8   9   A   B   C   D   E   F
  +---+---+---+---+---+---+---+---+---+---+---+---+---+---+---+---+
0000| v | a | l | u | e | v | a | l | u | 1 | v | a | l | u | e | v |
  +---+---+---+---+---+---+---+---+---+---+---+---+---+---+---+---+
0010| a | l | u | 2 | v | a | l | u | e | v | a | l |
  +---+---+---+---+---+---+---+---+---+---+---+---+
6
    0   1   2   3   4   5   6   7   8   9   A   B   C   D   E   F
  +---+---+---+---+---+---+---+---+---+---+---+---+---+---+---+---+
0000| v | a | l | u | e | v | a | l | u | 1 | v | a | l | u | e | v |
  +---+---+---+---+---+---+---+---+---+---+---+---+---+---+---+---+
0010| a | l | u | 2 | v | a | l | u | e | v | a | l | u | 3 | v | a |
  +---+---+---+---+---+---+---+---+---+---+---+---+---+---+---+---+
0020| l | u | e | v | a | l | u | 4 | v | a |
  +---+---+---+---+---+---+---+---+---+---+


If you look in the code ...


      //.jacob. I've discarded the "assigned" concept.

Once upon a time there was a concept of 'assigned' on a Preparse

buffer. If this was set when you had a fragmentation issue, youcouldsafely release the buffer since NOTHING in the SIP header is usingthebuffer. NOTE WELL: THe first buffer is being used for the headerfield

name. The second through n-1 buffers are useless duplication.

I think this is a bug that was introduced with the MHS implementation
when the assigned state was taken out of the FSM. Not a big deal and
we can fix it.

Additionally, it would be a great idea for us to change the testcases for connected transports to use a rediculously smallchunkSize to validate that this all works in all cases ( I haven'tchecked that we do this yet ).


Thanks,

Alan  Hawrylyshen
Early reSIProcate Developer and Lurker

Attachment: smime.p7s
Description: S/MIME cryptographic signature

References:
- [reSIProcate] resiprocate crash due to no free memory when receiving sip message with large header
  - From: Markus Eriksson
- Re: [reSIProcate] resiprocate crash due to no free memory when receiving sip message with large header
  - From: Byron Campen
- Re: [reSIProcate] resiprocate crash due to no free memory when receiving sip message with large header
  - From: Markus Eriksson
- Re: [reSIProcate] resiprocate crash due to no free memory when receiving sip message with large header
  - From: Byron Campen
- Re: [reSIProcate] resiprocate crash due to no free memory when receiving sip message with large header
  - From: Byron Campen
- Re: [reSIProcate] resiprocate crash due to no free memory when receiving sip message with large header
  - From: Alan Hawrylyshen

Prev by Date: Re: [reSIProcate] resiprocate crash due to no free memory when receiving sip message with large header
Next by Date: Re: [reSIProcate] resiprocate crash due to no free memory when receiving sip message with large header
Previous by thread: Re: [reSIProcate] resiprocate crash due to no free memory when receiving sip message with large header
Next by thread: Re: [reSIProcate] resiprocate crash due to no free memory when receiving sip message with large header
Index(es):
- Date
- Thread