Fix line skipping issue in receive_lines method #4491

yugeeklab · 2024-05-10T11:01:02Z

Which issue(s) this PR fixes:
Fixes #4494

What this PR does / why we need it:
Before this patch, long lines could cause breakdowns in fluentd, potentially posing a vulnerability. With this patch, max_line_size will be integrated into the FIFO, enabling the system to skip lines exceeding the maximum size before executing receive_lines.

Docs Changes:

Release Note:

daipom · 2024-05-13T03:17:48Z

@yugeeklab Thanks for this fix!
CI is currently unstable because of #4487. We will fix it. Sorry for the trouble.

I see the intent of this fix as follows.

In the current implementation, large lines that would eventually be discarded in receive_lines are temporarily held in IOHandler's @lines.
This is a waste of memory.
This PR resolves the waste.

Surely, such a fix would allow us to limit memory consumption by the max_line_size setting to some extent!

This PR would be effective to some extent, however I believe the problem of memory consumption will remain.
It would be possible that FIFO's @buffer becomes unlimitedly large if the @eol does not appear in the data.

Are these my understandings correct?

yugeeklab · 2024-05-13T09:39:11Z

Hi, @daipom

I've just published an issue #4491 for more information.

This PR would be effective to some extent, however I believe the problem of memory consumption will remain.
It would be possible that FIFO's @buffer becomes unlimitedly large if the @EOL does not appear in the data.

When max_line_size isn't set, FIFO's @buffer can grow indefinitely. Or if max_line_size has large value, FIFO's buffer will be limited, but there's still a possibility of fluentd experiencing slowdowns.

Summary:

as-is: max_line_size helps you avoid buffer overflow configuring via buffer section.
to-be: max_line_size helps prevent buffer overflow by configuring the buffer section and also ensures FIFO's buffer size remains limited.

If you have any suggestions, such as the fifo_buffer_size parameter or any other ideas, please feel free to discuss them with me.

Thank you for your review!

daipom · 2024-05-13T10:17:17Z

@yugeeklab

I've just published an issue #4491 for more information.

Thanks so much!

as-is: max_line_size helps you avoid buffer overflow configuring via buffer section. to-be: max_line_size helps prevent buffer overflow by configuring the buffer section and also ensures FIFO's buffer size remains limited.

Now I understand!
The following understanding was not correct.

This PR would be effective to some extent, however I believe the problem of memory consumption will remain.
It would be possible that FIFO's @buffer becomes unlimitedly large if the @eol does not appear in the data.

This fix clears FIFO's @buffer when read_lines.
So, this fix ensures FIFO's buffer size remains limited.

If you have any suggestions, such as the fifo_buffer_size parameter or any other ideas, please feel free to discuss them with me.

Thanks!
Basically, it seems to be a very good idea to limit the FIFO's buffer.

yugeeklab · 2024-05-13T11:15:35Z

Basically, it seems to be a very good idea to limit the FIFO's buffer.

Thank you for your comment!! @daipom

Please let me know if there's any feedback on my code or idea. I'll review and accept your feedback as soon as possible.

Thank you.

daipom · 2024-05-17T04:39:44Z

About CI failures, although #4493 has been resolved, we still need to resolve #4487.

daipom · 2024-05-17T08:35:31Z

@yugeeklab The CI issue has been resolved. Sorry for the trouble.
Could you please rebase this branch on the latest master?

yugeeklab · 2024-05-19T08:41:06Z

Hi, @daipom

Rebase is done!!

Thank you for your review!!

daipom · 2024-05-27T08:32:21Z

Sorry for waiting.
I will review this soon.

daipom

@yugeeklab Thanks for this fix!
This fix basically looks good to me!
I've commented on some minor details (about the following), please check!

Keeping the same debug log as before
Improving codes
Improving tests

lib/fluent/plugin/in_tail.rb

test/plugin/test_in_tail.rb

test/plugin/in_tail/test_fifo.rb

test/plugin/in_tail/test_io_handler.rb

lib/fluent/plugin/in_tail.rb

daipom · 2024-06-04T05:22:52Z

@yugeeklab
Thanks for updating.
I'm checking the CI failures.

daipom · 2024-06-04T05:30:08Z

Current CI failures have nothing to do with this PR.
Sorry for the trouble again.

daipom · 2024-06-07T08:41:54Z

The CI issue has been resolved.
So, could you please rebase this to the latest master?
Sorry for the trouble again.

Before this patch, long lines could cause breakdowns in fluentd, potentially posing a vulnerability. With this patch, max_line_size will be integrated into the FIFO, enabling the system to skip lines exceeding the maximum size before executing receive_lines. Co-authored-by: yugeeklab <yugeeklab@gmail.com> Co-authored-by: moggaa <donionbs7@gmail.com> Signed-off-by: been.zino <been.zino@kakaocorp.com>

Co-authored-by: Daijiro Fukuda <fukuda@clear-code.com> Signed-off-by: been.zino <been.zino@kakaocorp.com>

yugeeklab · 2024-06-09T08:26:06Z

Hi, @daipom

Rebase is done.
I also added a commit to resolve the following issue.
Please review it if you don't mind.

Thank you.

Signed-off-by: been.zino <been.zino@kakaocorp.com>

daipom

Thanks for the fix!
The following point is a remaining concern.

#4491 (comment)

I saw bf73efe and realized that there is a problem that needs to be solved about the management of pos

~~We should not update pos like this commit (bf73efe).~~
We should only update pos at points where recovery is possible.
This means that we should not update pos until we can be sure that @lines has been successfully handled by @receive_lines.
~~If updating pos like this commit, some data may be lost when BufferOverflowError occurs or when Fluentd is forced to stop.~~
(see #4491 (comment))

So, we need to consider how to manage pos correctly for this feature.
It needs to be able to continue processing correctly even if Fluentd is forced to stop.
Repeating process to skip long lines would be acceptable.
Data loss or sending corrupted data would be unacceptable.

For this feature, we need to take care of @was_long_line in particular.
We need to make sure that the restart of Fluentd does not cause a subsequent incomplete log to be sent.

daipom · 2024-06-12T05:58:12Z

We should not update pos like this commit (bf73efe).
...
If updating pos like this commit, some data may be lost when BufferOverflowError occurs or when Fluentd is forced to stop.

Oh, sorry, it was wrong.
The @lines.empty? condition will prevent it (probably...).

fluentd/lib/fluent/plugin/in_tail.rb

Lines 1230 to 1232 in bf73efe

    
           if @lines.empty? && has_skipped_line 
        
             @watcher.pe.update_pos(io.pos - @fifo.bytesize) 
        
           end

So, we need to consider only the following points.

For this feature, we need to take care of @was_long_line in particular.
We need to make sure that the restart of Fluentd does not cause a subsequent incomplete log to be sent.

daipom · 2024-06-12T06:19:50Z

lib/fluent/plugin/in_tail.rb

          @from_encoding = from_encoding
          @encoding = encoding
          @need_enc = from_encoding != encoding
          @buffer = ''.force_encoding(from_encoding)
          @eol = "\n".encode(from_encoding).freeze
+          @max_line_size = max_line_size
+          @was_long_line = false
+          @has_skipped_line = false


Is there any reason why keeping @has_skipped_line as an instance variable?
If it is not necessary to keep it, we should make it the local variable in read_lines() and make read_lines() return it as a returning value.

Please see #4491 (comment).

make has_skipped_line local variable is for this.

daipom · 2024-06-12T07:00:09Z

So, we need to consider only the following points.

For this feature, we need to take care of @was_long_line in particular.
We need to make sure that the restart of Fluentd does not cause a subsequent incomplete log to be sent.

I think we should change FIFO#bytesize.

fluentd/lib/fluent/plugin/in_tail.rb

Lines 1087 to 1089 in bf73efe

    
           def bytesize 
        
             @buffer.bytesize 
        
           end

It is used in the following pos logic:

fluentd/lib/fluent/plugin/in_tail.rb

Lines 1230 to 1241 in bf73efe

    
           if @lines.empty? && has_skipped_line 
        
             @watcher.pe.update_pos(io.pos - @fifo.bytesize) 
        
           end 
        
           unless @lines.empty? 
        
             if @receive_lines.call(@lines, @watcher) 
        
               @watcher.pe.update_pos(io.pos - @fifo.bytesize) 
        
               @lines.clear 
        
             else 
        
               read_more = false 
        
             end 
        
           end

fluentd/lib/fluent/plugin/in_tail.rb

Lines 1246 to 1248 in bf73efe

    
           def open 
        
             io = Fluent::FileWrapper.open(@path) 
        
             io.seek(@watcher.pe.read_pos + @fifo.bytesize)

The bytesize should be the uncommitted byte size that FIFO is still handling.
It does not equal the size of the buffer of FIFO anymore because FIFO can clear the buffer to skip the long line.

In the following case (max_line_size 12), very long line not finished yet will be cleared from the buffer soon.

short line\n # To be committed to the pos
very long line not finished yet # Not to be committed to the pos until the `@eol` occurs.

However, that data size should be considered for pos handling.
Since the line is not finished yet, the pos update should be done up to the end of short line\n.
(When Fluentd restarts, Fluentd should continue the process from the end of short line\n.)
Also, the reopening pos should be from the end of very long line not finished yet (especially for the case open_on_every_update).

For this, FIFO#bytesize should be the uncommitted byte size that FIFO is still handling, not the real buffer size of FIFO.

daipom · 2024-06-12T09:10:29Z

@yugeeklab I have fixed the remaining points and pushed them to my tmp branch (the following 3 commits).
Could you please check them?
If there is no problem, I will push these commits to this PR.
If you have any concerns or ideas, please let me know.

https://github.com/daipom/fluentd/tree/in_tail-improve-max_line_size

The main point is to resolve the issue that is tested on the 'discards a subsequent data in a long line even if restarting occurs between' test in fix to commit the correct pos to continue processing correctly.
This test would fail in the current branch.

daipom self-requested a review May 13, 2024 02:54

yugeeklab marked this pull request as ready for review May 13, 2024 09:18

yugeeklab force-pushed the master branch from cd9affb to 2c72611 Compare May 13, 2024 09:49

yugeeklab force-pushed the master branch 2 times, most recently from 7082a95 to 8463d57 Compare May 19, 2024 08:38

daipom requested changes May 28, 2024

View reviewed changes

daipom reviewed May 28, 2024

View reviewed changes

lib/fluent/plugin/in_tail.rb Outdated Show resolved Hide resolved

yugeeklab force-pushed the master branch from de93e08 to b7f5859 Compare June 9, 2024 08:13

yugeeklab and others added 6 commits June 9, 2024 17:16

Update lib/fluent/plugin/in_tail.rb

514760c

Co-authored-by: Daijiro Fukuda <fukuda@clear-code.com> Signed-off-by: been.zino <been.zino@kakaocorp.com>

Update lib/fluent/plugin/in_tail.rb

dfc0c4b

Co-authored-by: Daijiro Fukuda <fukuda@clear-code.com> Signed-off-by: been.zino <been.zino@kakaocorp.com>

Update lib/fluent/plugin/in_tail.rb

8abe350

Co-authored-by: Daijiro Fukuda <fukuda@clear-code.com> Signed-off-by: been.zino <been.zino@kakaocorp.com>

Update test/plugin/test_in_tail.rb

d35fccd

Co-authored-by: Daijiro Fukuda <fukuda@clear-code.com> Signed-off-by: been.zino <been.zino@kakaocorp.com>

Update test/plugin/in_tail/test_fifo.rb

cffdfff

Co-authored-by: Daijiro Fukuda <fukuda@clear-code.com> Signed-off-by: been.zino <been.zino@kakaocorp.com>

Update lib/fluent/plugin/in_tail.rb

9e9947a

Co-authored-by: Daijiro Fukuda <fukuda@clear-code.com> Signed-off-by: been.zino <been.zino@kakaocorp.com>

yugeeklab force-pushed the master branch from b7f5859 to 1c5c571 Compare June 9, 2024 08:17

yugeeklab force-pushed the master branch from ac9c985 to 0d45c3b Compare June 9, 2024 23:47

Update pos even if line is skipped

bf73efe

Signed-off-by: been.zino <been.zino@kakaocorp.com>

yugeeklab force-pushed the master branch from 0d45c3b to bf73efe Compare June 9, 2024 23:48

daipom requested changes Jun 12, 2024

View reviewed changes

daipom reviewed Jun 12, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix line skipping issue in receive_lines method #4491

Fix line skipping issue in receive_lines method #4491

yugeeklab commented May 10, 2024 •

edited

daipom commented May 13, 2024 •

edited

yugeeklab commented May 13, 2024 •

edited

daipom commented May 13, 2024

yugeeklab commented May 13, 2024

daipom commented May 17, 2024

daipom commented May 17, 2024

yugeeklab commented May 19, 2024

daipom commented May 27, 2024

daipom left a comment

daipom commented Jun 4, 2024

daipom commented Jun 4, 2024

daipom commented Jun 7, 2024

yugeeklab commented Jun 9, 2024

daipom left a comment •

edited

daipom commented Jun 12, 2024 •

edited

daipom Jun 12, 2024

daipom Jun 12, 2024

daipom commented Jun 12, 2024

daipom commented Jun 12, 2024 •

edited

Fix line skipping issue in receive_lines method #4491

Are you sure you want to change the base?

Fix line skipping issue in receive_lines method #4491

Conversation

yugeeklab commented May 10, 2024 • edited

daipom commented May 13, 2024 • edited

yugeeklab commented May 13, 2024 • edited

daipom commented May 13, 2024

yugeeklab commented May 13, 2024

daipom commented May 17, 2024

daipom commented May 17, 2024

yugeeklab commented May 19, 2024

daipom commented May 27, 2024

daipom left a comment

Choose a reason for hiding this comment

daipom commented Jun 4, 2024

daipom commented Jun 4, 2024

daipom commented Jun 7, 2024

yugeeklab commented Jun 9, 2024

daipom left a comment • edited

Choose a reason for hiding this comment

daipom commented Jun 12, 2024 • edited

daipom Jun 12, 2024

Choose a reason for hiding this comment

daipom Jun 12, 2024

Choose a reason for hiding this comment

daipom commented Jun 12, 2024

daipom commented Jun 12, 2024 • edited

yugeeklab commented May 10, 2024 •

edited

daipom commented May 13, 2024 •

edited

yugeeklab commented May 13, 2024 •

edited

daipom left a comment •

edited

daipom commented Jun 12, 2024 •

edited

daipom commented Jun 12, 2024 •

edited