FreeBSD/src 6e76489sys/netinet tcp_pcap.c tcp_pcap.h

tcp: remove support for TCPPCAP

This feature could be used to store the last sent and received TCP
packets for a TCP endpoint. There was no utility to get these packets
from a live system or core.
This functionality is now provided by TCP Black Box Logging, which also
stores additional events. There are tools to get these traces from a
live system or a core.
Therefore remove TCPPCAP to avoid maintaining it, when it is not
used anymore.

Reviewed by:            rrs, rscheff, Peter Lei, glebiu
Sponsored by:           Netflix, Inc.
Differential Revision:  https://reviews.freebsd.org/D49589
DeltaFile
+0-452sys/netinet/tcp_pcap.c
+0-39sys/netinet/tcp_pcap.h
+0-33sys/netinet/tcp_usrreq.c
+0-24sys/netinet/tcp_subr.c
+0-12sys/netinet/tcp_output.c
+0-7sys/netinet/tcp_input.c
+0-5678 files not shown
+5-58314 files

FreeBSD/src 1043b36sys/netinet tcp_output.c

tcp: don't send beyond receivers advertised window

When calculating length of data to be sent, we may do some congestion
calculations, but we shall never send a byte beyond (snd_una + snd_wnd).
In 7dc78150c730e we started to use tcp_compute_pipe() instead of (snd_wnd
- off).  This function makes an estimate of how much data is in flight. It
can return a value smaller and larger than (snd_nxt - snd_una).  It will
return a value larger when we have retransmitted some data from SACK
holes, and smaller once those retransmitted SACK holes are acked.

We may use tcp_compute_pipe() for length calculation, but always capped
by the send offset 'off', which (snd_nxt - snd_una).

PR:                     283649
Reviewed by:            rscheff
Differential Revision:  https://reviews.freebsd.org/D48237
Fixes:                  7dc78150c730e90fa2afdaba3aa645932b30c429
DeltaFile
+3-2sys/netinet/tcp_output.c
+3-21 files

FreeBSD/src 72c11c4sys/netinet tcp_output.c

tcp: make sack_rxmit in tcp_output() a bool

Also make this variable initialization, as well as accompanying sackhole
pointer, slightly more readable. NFC.

Reviewed by:            rscheff, tuexen, rrs
Differential Revision:  https://reviews.freebsd.org/D48235
DeltaFile
+14-11sys/netinet/tcp_output.c
+14-111 files

FreeBSD/src 1a264ffsys/netinet tcp_output.c

tcp_output: Clear FIN if tcp_m_copym truncates output length

Reviewed by:    rscheff, tuexen, gallatin
Sponsored by:   Chelsio Communications
Differential Revision:  https://reviews.freebsd.org/D46824

(cherry picked from commit 519981e3c09c2875b136635459c87ab24d3104e2)
DeltaFile
+5-0sys/netinet/tcp_output.c
+5-01 files

FreeBSD/src 05d81fesys/netinet tcp_output.c

tcp_output: Clear FIN if tcp_m_copym truncates output length

Reviewed by:    rscheff, tuexen, gallatin
Sponsored by:   Chelsio Communications
Differential Revision:  https://reviews.freebsd.org/D46824

(cherry picked from commit 519981e3c09c2875b136635459c87ab24d3104e2)
DeltaFile
+5-0sys/netinet/tcp_output.c
+5-01 files

FreeBSD/src 0fc7bdcsys/dev/irdma irdma_cm.c, sys/netgraph/netflow netflow.c

tcp: extend the use of the th_flags accessor function

Formally, there are 12 bits for TCP header flags.
Use the accessor functions in more (kernel) places.

No functional change.

Reviewed By: cc, #transport, cy, glebius, #iflib, kbowling
Sponsored by: NetApp, Inc.
Differential Revision: https://reviews.freebsd.org/D47063
DeltaFile
+40-40sys/netpfil/pf/pf.c
+16-16sys/dev/irdma/irdma_cm.c
+16-11sys/netpfil/ipfw/ip_fw2.c
+11-11sys/netpfil/ipfilter/netinet/ip_state.c
+8-8usr.sbin/ppp/slcompress.c
+8-8sys/netgraph/netflow/netflow.c
+99-9437 files not shown
+193-18743 files

FreeBSD/src 22dcc81sys/netinet/cc cc_cubic.c cc_chd.c

tcp: Use segment size excluding tcp options for all cwnd calculations

Avoid sending small segments by making sure that cwnd is usually
calculated in full (data) segment sizes. Especially during loss
recovery and retransmission scenarios.

Reviewed By: tuexen, #transport
Sponsored by: NetApp, Inc.
Differential Revision: https://reviews.freebsd.org/D47474
DeltaFile
+12-12sys/netinet/cc/cc_cubic.c
+11-9sys/netinet/cc/cc_chd.c
+8-7sys/netinet/cc/cc_htcp.c
+7-6sys/netinet/cc/cc.c
+7-6sys/netinet/cc/cc_dctcp.c
+6-5sys/netinet/cc/cc_vegas.c
+51-453 files not shown
+61-549 files

FreeBSD/src 8f5a2e2sys/netinet tcp_output.c tcp_input.c

tcp: fix cwnd recalculation during limited transmit

Properly calculate the expected flight size (cwnd) during
limited transmit. Exclude the SACK scoreboard from
consideration when still in limited transmit.

PR: 282605
Reviewed By: tuexen, #transport
Sponsored by: NetApp, Inc.
Differential Revision: https://reviews.freebsd.org/D47541
DeltaFile
+2-1sys/netinet/tcp_output.c
+1-1sys/netinet/tcp_input.c
+3-22 files

FreeBSD/src c9047ebsys/netinet tcp_output.c, sys/netinet/tcp_stacks rack.c bbr.c

tcp: allow TSO even while RX path is unordered

Over IP networks, forward and return path largely
act independently from each other. Do not disable LRO
on the TX side, when reordering/loss is happening
on the RX half-connection.

Reviewed By: rrs, #transport, peter.lei_ieee.org
Sponsored by: NetApp, Inc.
Differential Revision: https://reviews.freebsd.org/D47056
DeltaFile
+4-4sys/netinet/tcp_output.c
+1-4sys/netinet/tcp_stacks/rack.c
+0-1sys/netinet/tcp_stacks/bbr.c
+5-93 files

FreeBSD/src dded4e9sys/netinet tcp_input.c tcp_output.c

tcp: change SOCKBUF_* macros to SOCK_[RECV|SEND]BUF_* macros

Change the older LOCK related macros over to the
dedicated send/recv buffer macros in the base tcp stack.

No functional change intended.

Reviewed By: tuexen, #transport
Sponsored by: NetApp, Inc.
Differential Revision: https://reviews.freebsd.org/D47567
DeltaFile
+14-14sys/netinet/tcp_input.c
+11-11sys/netinet/tcp_output.c
+5-5sys/netinet/tcp_usrreq.c
+2-2sys/netinet/tcp_reass.c
+2-2sys/netinet/tcp_subr.c
+34-345 files

FreeBSD/src 7dc7815sys/netinet tcp_input.c tcp_output.c

tcp: refactor cwnd during SACK transmissions to allow TSO

Refactoring of cwnd and moving the adjustment for SACKed data into
tcp_output() - cwnd tracking the maximum extent starting at snd_una -
allows both SACK loss recovery as well as SACK transmissions after
RTO during slow start and if allowed, the use of TSO while in loss
recovery.

Reviewed By:            tuexen, cc, #transport
Sponsored by:           NetApp, Inc.
Differential Revision:  https://reviews.freebsd.org/D43470
DeltaFile
+52-33sys/netinet/tcp_input.c
+37-32sys/netinet/tcp_output.c
+8-9sys/netinet/tcp_sack.c
+97-743 files

FreeBSD/src 6b2977csys/netinet tcp_sack.c tcp_output.c

tcp: fix duplicate retransmissions when RTO happens during SACK loss recovery

When snd_nxt doesn't track snd_max, partial SACK ACKs may elicit
unexpected duplicate retransmissions. This is usually masked by
LRO not necessarily ACKing every individual segment, and prior
to RFC6675 SACK loss recovery, harder to trigger even when an
RTO happens while SACK loss recovery is ongoing.

Address this by improving the logic when to start a SACK loss recovery
and how to deal with a RTO, as well as improvements to the adjusted
congestion window during transmission selection.

Reviewed By:    tuexen, cc, #transport
Sponsored by:   NetApp, Inc.
MFC after:      1 week
Differential Revision:  https://reviews.freebsd.org/D43355

(cherry picked from commit 440f4ba18e3ab7be912858bbcb96a419fcf14809)
DeltaFile
+27-15sys/netinet/tcp_sack.c
+17-10sys/netinet/tcp_output.c
+10-6sys/netinet/tcp_input.c
+1-1sys/netinet/tcp_var.h
+55-324 files

FreeBSD/src 440f4basys/netinet tcp_output.c tcp_sack.c

tcp: fix duplicate retransmissions when RTO happens during SACK loss recovery

When snd_nxt doesn't track snd_max, partial SACK ACKs may elicit
unexpected duplicate retransmissions. This is usually masked by
LRO not necessarily ACKing every individual segment, and prior
to RFC6675 SACK loss recovery, harder to trigger even when an
RTO happens while SACK loss recovery is ongoing.

Address this by improving the logic when to start a SACK loss recovery
and how to deal with a RTO, as well as improvements to the adjusted
congestion window during transmission selection.

Reviewed By:    tuexen, cc, #transport
Sponsored by:   NetApp, Inc.
MFC after:      1 week
Differential Revision:  https://reviews.freebsd.org/D43355
DeltaFile
+16-9sys/netinet/tcp_output.c
+14-8sys/netinet/tcp_sack.c
+10-8sys/netinet/tcp_input.c
+1-1sys/netinet/tcp_var.h
+41-264 files

FreeBSD/src 519981esys/netinet tcp_output.c

tcp_output: Clear FIN if tcp_m_copym truncates output length

Reviewed by:    rscheff, tuexen, gallatin
Sponsored by:   Chelsio Communications
Differential Revision:  https://reviews.freebsd.org/D46824
DeltaFile
+5-0sys/netinet/tcp_output.c
+5-01 files

FreeBSD/src a00c3a9sys/netinet tcp_output.c tcp.h

tcp: remove remnants of 20+ year old disabled code from d912c694ee00

Fixes:  90ad2dc28747a3a5036d68ccc87d2352459f5cb1
DeltaFile
+0-3sys/netinet/tcp_output.c
+0-2sys/netinet/tcp.h
+0-52 files

FreeBSD/src 2290185sys/netinet tcp_output.c

tcp: no data on SYN segments unless doing TFO

Ensure that there is no data on SYN segments unless doing TFO.
This check is already in RACK and BBR.

Reported by:            glebius
Reviewed by:            rscheff
Sponsored by:           Netflix, Inc.
Differential Revision:  https://reviews.freebsd.org/D44384

(cherry picked from commit af700f430fd86ba3eae63e587985a12436db8f69)
DeltaFile
+6-0sys/netinet/tcp_output.c
+6-01 files

FreeBSD/src b691974sys/netinet tcp_output.c tcp_subr.c

ipsec_offload: handle TSO if supported

Allow for TSO to operate if network interface supports ipsec inline
offload and supports TSO over it.

Reviewed by:    tuexen
Sponsored by:   NVIDIA networking
Differential revision:  https://reviews.freebsd.org/D44222
DeltaFile
+9-11sys/netinet/tcp_output.c
+4-0sys/netinet/tcp_subr.c
+2-0sys/netinet/tcp_var.h
+2-0sys/netinet/tcp_input.c
+17-114 files

FreeBSD/src 2a9aae9sys/netinet tcp_output.c tcp_var.h, usr.bin/netstat inet.c

tcp: add counter to track when SACK loss recovery uses TSO

Add a counter to track how frequently SACK has transmitted
more than one MSS using TSO. Instances when this will be
beneficial is the use of PRR, or when ACK thinning due to
GRO/LRO or ACK discards by the network are present.

Reviewed By:            tuexen, #transport
Sponsored by:           NetApp, Inc.
Differential Revision:  https://reviews.freebsd.org/D45070
DeltaFile
+5-3usr.bin/netstat/inet.c
+3-0sys/netinet/tcp_output.c
+2-1sys/netinet/tcp_var.h
+1-0sys/netinet/in_kdtrace.c
+1-0sys/netinet/in_kdtrace.h
+12-45 files

FreeBSD/src dcdfe44sys/netinet tcp_sack.c tcp_var.h

tcp: add sysctl to allow/disallow TSO during SACK loss recovery

Introduce net.inet.tcp.sack.tso for future use when TSO is ready
to be used during loss recovery.

Reviewed By:            tuexen, #transport
Sponsored by:           NetApp, Inc.
Differential Revision:  https://reviews.freebsd.org/D45068
DeltaFile
+5-0sys/netinet/tcp_sack.c
+2-0sys/netinet/tcp_var.h
+1-1sys/netinet/tcp_output.c
+8-13 files

FreeBSD/src fd2a580sys/netinet tcp_output.c

tcp: no data on SYN segments unless doing TFO

Ensure that there is no data on SYN segments unless doing TFO.
This check is already in RACK and BBR.

Reported by:            glebius
Reviewed by:            rscheff
Sponsored by:           Netflix, Inc.
Differential Revision:  https://reviews.freebsd.org/D44384

(cherry picked from commit af700f430fd86ba3eae63e587985a12436db8f69)
DeltaFile
+6-0sys/netinet/tcp_output.c
+6-01 files

FreeBSD/src af700f4sys/netinet tcp_output.c

tcp: no data on SYN segments unless doing TFO

Ensure that there is no data on SYN segments unless doing TFO.
This check is already in RACK and BBR.

Reported by:            glebius
Reviewed by:            rscheff
MFC after:              1 week
Sponsored by:           Netflix, Inc.
Differential Revision:  https://reviews.freebsd.org/D44384
DeltaFile
+6-0sys/netinet/tcp_output.c
+6-01 files

FreeBSD/src dd7b86esys/netinet tcp_usrreq.c tcp_input.c, sys/netinet/tcp_stacks rack.c bbr.c

tcp: remove IS_FASTOPEN() macro

The macro is more obfuscating than helping as it just checks a single flag
of t_flags.  All other t_flags bits are checked without a macro.

A bigger problem was that declaration of the macro in tcp_var.h depended
on a kernel option.  It is a bad practice to create such definitions in
installable headers.

Reviewed by:            rscheff, tuexen, kib
Differential Revision:  https://reviews.freebsd.org/D44362
DeltaFile
+18-17sys/netinet/tcp_stacks/rack.c
+13-13sys/netinet/tcp_stacks/bbr.c
+7-8sys/netinet/tcp_usrreq.c
+7-7sys/netinet/tcp_input.c
+7-7sys/netinet/tcp_output.c
+0-8sys/netinet/tcp_var.h
+52-602 files not shown
+54-628 files

FreeBSD/src a8e817csys/netinet tcp_output.c

tcp: stop doing superfluous work after sending RST

When sending a RST control segment in tcp_output() it
means we are in TCPS_CLOSED state, called from tcp_drop().
Once the RST is sent, don't call tcp_timer_activate() or
update anything in tcpcb, since that will go away shortly.

PR:                     276761
Provided by:            glebius
Reviewed By:            glebius, tuexen, #transport
Sponsored by:           NetApp, Inc.
Differential Revision:  https://reviews.freebsd.org/D43808
DeltaFile
+6-2sys/netinet/tcp_output.c
+6-21 files

FreeBSD/src 2d05a1csys/netinet tcp_input.c tcp_sack.c

tcp: commonize check for more data to send, style changes

Use SEQ_SUB instead of a plain subtraction, for an implict
type conversion and prevention of a possible overflow.
Use curly brackets in stacked if statements throughout.
Use of the ? operator to enhance readability when clearing
the FIN flag in tcp_output().

None of the above change the function.

Reviewed By:           tuexen, cc, #transport
Sponsored by:          NetApp, Inc.
Differential Revision: https://reviews.freebsd.org/D43539
DeltaFile
+55-39sys/netinet/tcp_input.c
+19-12sys/netinet/tcp_sack.c
+4-8sys/netinet/tcp_output.c
+5-4sys/netinet/tcp_ecn.c
+83-634 files

FreeBSD/src 0932fb5sys/netinet tcp_output.c

tcp: fix TCPSTAT accounting for SACK

Account for SACK retransmitted bytes once the actual length
is known. This prevents a call to tcp_maxseg() and prepares
for TSO support when transmitting from the SACK scoreboard.

Reviewed By:           tuexen, #transport
Sponsored by:          NetApp, Inc.
Differential Revision: https://reviews.freebsd.org/D43447
DeltaFile
+4-3sys/netinet/tcp_output.c
+4-31 files

FreeBSD/src 90ad2dcsys/netinet tcp_output.c

tcp: remove 20+ year old disabled code from d912c694ee00
DeltaFile
+0-10sys/netinet/tcp_output.c
+0-101 files

FreeBSD/src c809435sys/netinet tcp_output.c

tcp: clear outdated comment mentioning T/TCP
DeltaFile
+1-2sys/netinet/tcp_output.c
+1-21 files

FreeBSD/src b1f4229sys/kern uipc_mbuf.c, sys/netinet tcp_pcap.c tcp_output.c

sys: Style fix for M_EXT | M_EXTPG

Add a space around the | operator in places testing for either M_EXT
or M_EXTPG.

Reviewed by:    imp, glebius
Sponsored by:   Chelsio Communications
Differential Revision:  https://reviews.freebsd.org/D43216

(cherry picked from commit f7d5900aa063c6e9d19e6372ba981fc76aa326a2)
DeltaFile
+9-9sys/kern/uipc_mbuf.c
+1-1sys/netinet/tcp_pcap.c
+1-1sys/netinet/tcp_output.c
+1-1sys/netinet/tcp_stacks/rack.c
+12-124 files

FreeBSD/src e000bd2sys/kern uipc_mbuf.c, sys/netinet tcp_output.c tcp_pcap.c

sys: Style fix for M_EXT | M_EXTPG

Add a space around the | operator in places testing for either M_EXT
or M_EXTPG.

Reviewed by:    imp, glebius
Sponsored by:   Chelsio Communications
Differential Revision:  https://reviews.freebsd.org/D43216

(cherry picked from commit f7d5900aa063c6e9d19e6372ba981fc76aa326a2)
DeltaFile
+9-9sys/kern/uipc_mbuf.c
+1-1sys/netinet/tcp_output.c
+1-1sys/netinet/tcp_pcap.c
+1-1sys/netinet/tcp_stacks/rack.c
+12-124 files

FreeBSD/src 8923d34sys/netinet tcp_output.c tcp_input.c

tcp: clean PRR state after ECN congestion recovery.

PRR state was not properly reset on subsequent ECN CE
events. Clean up after local transmission failures too.

Reviewed by:           tuexen, cc, #transport
MFC after:             3 days
Sponsored by:          NetApp, Inc.
Differential Revision: https://reviews.freebsd.org/D43170

(cherry picked from commit 429f14f83ae1f6357a9455d4f075522420b01b03)
DeltaFile
+2-0sys/netinet/tcp_output.c
+1-0sys/netinet/tcp_input.c
+3-02 files