Skip to content
Snippets Groups Projects
Commit f43d94c5 authored by Jan David Mol's avatar Jan David Mol
Browse files

Task #11059: Data loss: Added impact of payload, and hint to check VLAN IPs

parent 3a157b25
No related branches found
No related tags found
No related merge requests found
......@@ -22,9 +22,15 @@ Total input loss occurs when:
||_|\_ 1-digit bord number (0..3, and 6..9 for HBA1)
| \___ 3-digit station number
\_____ fixed prefix
* For international stations, the receiving COBALT node needs to have the right VLANs configured. If not, the packets will
arrive on eth5 (cbt00x-10GB04), but dropped as the destination IP (belonging to the VLAN) does not exist.
* As root on COBALT, run "tcpdump -i <interface> udp -c 100", and check if the packets are received and correctly addressed.
* For international stations, the receiving COBALT node needs to have the right VLANs configured. If not, the packets will
arrive on eth5 (cbt00x-10GB04), but dropped as the destination IP (belonging to the VLAN) does not exist. Check with
"ip addr" which IPs exist, if you see packets arriving to VLAN IPs.
* The network drops the datagrams due to routing issues. Trace the station route through the network:
https://www.astron.nl/lofarwiki/doku.php?id=wanarea:start
......@@ -55,6 +61,20 @@ Fractional or total input loss occurs when:
- "payload error" means the packet is marked as incomplete by the station.
- "otherwise bad" means the packet header is corrupted.
* The impact of payload errors is signficant. They arrive scattered over time, and any flagged input is smeared over hundreds of samples
during processing due to the FIR filter. For a 64-channel interferometry observation, we measured the following:
% payload errors % visibilities flagged
--------------------------------------------
3.5% 91%
1.9% 73%
1.5% 63%
1.06% 44%
0.22% 14%
0.19% 12%
0.10% 6.7%
0.002% 0.13%
* COBALT is not running at real time, and is thus unable to keep up with the input data. This triggers many errors, but all cases devolve into printing:
>>> ERROR RTCP.Cobalt.GPUProc - [block 1] Not running at real time! Deadline was 1.23456 seconds ago
......
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment