• 0 Posts
  • 46 Comments
Joined 11 months ago
cake
Cake day: June 22nd, 2024

help-circle

  • Gyroplast@pawb.socialtoich_iel@feddit.orgich📺iel
    link
    fedilink
    Deutsch
    arrow-up
    1
    arrow-down
    3
    ·
    7 days ago

    Genau! GENAU!! Das ist genau was ich meinte, was passieren kann. Du hast aus meiner Sicht eben nicht eindeutig zugestimmt, und das öffnet den Kommentar der Interpretation.

    Wenn man dann noch so viel schwätzt wie ich, wird auch das entscheidende “fälschlich” und “tatsächliche” überlesen oder anders interpretiert als gedacht, was deinen Kommentar von ebendiesen Arschlöchern eigentlich klar abgrenzen sollte. Tut mir Leid, ich wollte keinesfalls unterstellen, dass Du eine manipulative Technik angewandt hättest, bewusst oder unbewusst, sondern zeigen, wie leicht es ist, nur auf Basis eines Fetzen Textes als Leser zu einer solchen Unterstellung oder sonstigen, unbeabsichigten Interpretation zu gelangen.

    Es ist leider so, dass verschiedene Intentionen zu identischem Text führen können. Der Rückschluss, ich würde dir die ungünstigste Intention unterstellen, ist seinerseits so unzulässig wie unzutreffend, und nochmal ein schönes Beispiel, warum eindeutige Kommunikation immer ein ungelöstes Problem bleiben wird.

    Machen mer’s einfach: Du hast nichts falsch gemacht, ich will dir nix Böses, ich hab viel zu viel off-topic geschrieben und halte jetzt mein Maul, in der Hoffnung dir nicht den Start in den Tag madig gemacht zu haben, weil ein Spinner im Zwischennetz dich von der Seite angekackt hat. Das wollte ich nicht.

    Gottesgeschwindigkeit, zarte Männer!


  • Gyroplast@pawb.socialtoich_iel@feddit.orgich📺iel
    link
    fedilink
    arrow-up
    1
    arrow-down
    4
    ·
    7 days ago

    Du hast auf den Hinweis “ey, denk’ mal drüber nach, das Idiom auszuschleichen, weil hart negativ konnotiert wegen Nazis” reagiert mit einem Beleg für “ist nicht hart negativ konnotiert wegen Nazis”. Das “zum Teil Recht haben” kann man dann leider so oder so interpretieren. Wohlwollend als “Du hast Recht mit: ist ein Kackbegriff, den man besser nicht nutzen sollte”, weniger wohlwollend und leider häufiger als “Du hast Unrecht mit: bezieht sich auf die Nazizeit”, und die andere Seite des Recht habens wird vergessen, oder gar bewusst als argumentative Technik unterschlagen => Sofort-Flammenkrieg.

    ZL;NG: Deinen kurzen Kommentar kann man fälschlich als ausweichend oder ablenkend interpretieren, was eine beliebte Technik von tatsächlichen Arschlöchern ist, um zu provozieren. Vorschlagsanhängsel: “Der Spruch ist trotzdem kagge, danke für den Hinweis.”

    Kommunikation nervt. HDGDL.


  • Gyroplast@pawb.socialtoich_iel@feddit.orgich📺iel
    link
    fedilink
    Deutsch
    arrow-up
    10
    arrow-down
    2
    ·
    7 days ago

    Scheint mir ziemlich degeneriert und entartet, semantisch spitzfindig zu werden, wenn man auf einen nachvollziehbaren Effekt auf die Volksgesundheit in höflichster Form hingewiesen wird. Aber wir brauchen hier keine Endlösung für die Reinhaltung deutscher Sprache diskutieren, oder gar schädliche Verwendungen in eine positive Konnotation umkehren; das wäre voll schwul.

    Entgeltendmachung: kein Angriff auf dich, RFr. Fand’s nur angemessen, satirisch überzogen darzustellen, warum es keinerlei Rolle spielt, ob jemand “Recht hat” mit einer Wortherkunft, wenn der soziale Effekt unstrittig ist, und es sich um einen in keiner Weise fordernden oder vorwurfsvollen Hinweis handelt, den man beachten kann, wenn man nicht versehentlich(!) wie ein Stück Scheiße in illustrer Runde wirken möchte, obwohl man das gar nicht ist.

    Hochwähli ist raus, weitermachen!





  • Gyroplast@pawb.socialtoich_iel@feddit.orgich📏iel
    link
    fedilink
    Deutsch
    arrow-up
    3
    ·
    10 days ago

    Die Angabe hat die Schnüffelprüfung schon nicht überstanden, auswendig wusste ich das auch nicht mehr. 5 Joule sind echt wenig, und ein Liter Wasser schon echt viel. 5 Joule erzeuge ich Dir geschätzt mit einem freundlichen Handschlag auf die Wasseroberfläche, davon wird das nicht 1 °K wärmer.

    Aus der Schulzeit hängt mir noch heute was im Kopf: 1 Joule entspricht etwa der Kraft, die man auf der Erde aufwendet, um eine Tafel Schokolade (≙ 100 g) einen Meter gerade nach oben zu heben.







  • You could increase verbosity, and try working up your way from booting a bare minimum, to see when the system hangs, and if it persistently hangs at the same time, in the same way.

    My usual go-to is to add debug apic=debug init=/bin/sh vga=0 nomodeset acpi=off to kernel boot arguments and see if I consistently drop into the bare initramfs shell that way, without switching to any framebuffer graphics mode, while also avoiding potential ACPI breakage that may manifest as early boot freezes. Yes, vga=0 is legacy BIOS only, feel free to skip that one if you’re booting UEFI. This is not likely to avoid your problem, anyway.

    If that works, remove the arguments, from the right, one after another, to re-enable ACPI, then KMS, then automatic framebuffer console setting. If you’re still going, change init=/bin/sh to emergency, then to rescue, then remove to boot normally, always with excessive debug output. At that point, boot should freeze again, as you’ve only increased verbosity. The messages leading up to the freeze should always give a hint as to what subsystem might be worth looking into further - be it a specific module that freezes, which can subsequently be blacklisted by kernel parameter, for example. Let the system tell you its woes before stabbing at its parts randomly.

    This does not assume you having a software fault. This procedure uses the kernel init and following boot process as diagnostics, in a way. Unfortunately, it is pretty easy to miss output that is “out of the ordinary” if you’re not used to how a correct boot is supposed to look like, but the info you need is typically there. I typically try this before unplugging all optional hardware, but both approaches go hand in hand, really. I’ve found in modern, highly integrated systems, there’s just not that much available to unplug anymore that would make a difference at boot time, but the idea is still sound.

    If this becomes involved, you might want to look into using netconsole to send the kernel messages somewhere else to grab with netcat, and store them in a plain text file to post here for further assistance. You might just get a good hint when reading the debug kernel messages yourself already, though!

    EDIT: If those two colorful, pixely dotted lines in the lower half of your literal screen shot happen to flicker into view during boot somewhat consistently right before freezing, my gut feeling says it’s likely a graphics-related issue. You might want to short-circuit your tests by trying only debug nomodeset, a more brutal debug nomodeset module_blacklist=amdgpu,radeon, or replacing your GPU with a known good model, as suggested.


  • Do I need to run the machine for longer or should it have crashed right away according to your hypothesis?

    Sorry for mudding the waters with my verbosity. It should not crash anymore. I believe your kernel panic was caused when an idle CPU 6 was sent to sleep. Disabling C-states, or limiting them to C0 or C1, prevents your CPUs from going into (deep) sleep. Thusly, by disabling or limiting c-states, a kernel panic should not happen anymore.

    I haven’t found a way to explicitly put a core into a specific c-state of your choosing, so best I can recommend now is to keep your c-states disabled or limited to C1, and just normally use your computer. If this kernel panic shows up again, and you’re sure your c-state setting was effective, then I would consider my c-state hypothesis as falsified.

    If, however, your system runs normally for a few days, or “long enough for you to feel good about it” with disabled c-states, that would be a strong indication for having some kind of issue when entering deeper sleep modes. You may then try increasing the c-state limit again until your system becomes unstable. Then you know at least a workaround at the cost of some loss of power savings, and you can try to find specific issues with your CPU or mainboard concerning the faulty sleep mode on Linux.

    Best of luck!




  • screen froze, and I was forced to reboot the PC by pressing the power button for 3s

    seems like some data was saved, while other files were discarded

    I would not worry too much about a somehow “forgetful” file system immediately after a hard power cycle. This is exactly what happens if data could not be flushed to disk. Thanks to journaling, your FS does not get corrupted, but data lingering in caches is still lost and discarded on fsck, to retain a consistent fs. I would recommend to repeat the installations you did before the crash, and maybe shove a manual sync behind it, to make sure you don’t encounter totally weird “bugs” with man later, when you don’t remember this as a cause anymore. Your bash history is saved to file on clean shell exit only, and is generally a bit non-intuitive, especially with multiple interactive shells in parallel, so I would personally disregard the old .bash_history file as “not a fault, only confusing” and let that rest, too.

    Starting a long SMART self-test and a keen eye on the drive’s error logs (smartctl -l error <drive>), or better yet, all available SMART info (-x), to see if anything seems fishy with your drive is a good idea, anyway. Keep in mind that your mainboard / drive controller or its connection may just as well be (intermittently) faulty. In ye olden times, a defective disk cable or socket was messing up my system once or twice. You will see particular faults in your syslog, though - this is not invisible. You don’t only get a kernel panic without some sprinkling of I/O errors as well. If your drive is SMART-OK, but you clearly get disk I/O errors, time to inspect and clean the SSD socket and contacts and re-seat once more. If you never saw any disk I/O errors, and your disk’s logs are clean, I’d consider the SSD as not an issue.

    If you encouter random kernel panics, random as in “in different and unrelated call stacks that do not make sense in any other way”, I agree that RAM is a likely culprit, or an electrical fault somewhere on the mainboard. It’s rare, but it happens. If you can, replace (only) the mainboard, or better yet, take a working PC with compatible parts, and replace the working MBO with your suspected broken one to see if the previously working machine now faults. “Carrying the fault with you” is easier/quicker than proving an intermittent fault gone.

    Unless you get different kernel panics, my money’s still on your c-states handling. I’d prefer the lowest level you can find to inhibit your CPUs from going to sleep, i. e. BIOS > kernel boot args > sysctl > cpupower, to keep the stack thin. If that is finnicky somehow, you could alternatively boot with a single CPU and leave the rest disabled (bootarg nosmp). The point is just to find out where to focus your attention, not to keep this as a long-term workaround.

    To keep N CPUs running, I usually just background N infinite loops in bash:

    $ cpus=4; for i in $(seq 1 $cpus); do { while true; do true; done; } & done 
    [1] 7185
    [2] 7186
    [3] 7187
    [4] 7188
    

    In your case you might change that to:

    cpus=4; for i in $(seq 0 $((cpus - 1))); do { taskset -c $i bash -c 'while true; do sleep 1; done'; } & done
    

    To just kick each CPU every second, it does not have to be stressed. The taskset will bind each loop to one CPU, to prevent the system from cleverly distributing the tiny load. This could also become a terrible, terrible workaround to keep running if all else fails. :)


  • Gyroplast@pawb.socialtoArch Linux@lemmy.mlUnexplained reboots and kernel panic
    link
    fedilink
    English
    arrow-up
    23
    arrow-down
    1
    ·
    edit-2
    12 days ago

    Looking at the call trace:

    [ 1641.073507] RIP: 0010:rb_erase+0x199/0x3b0
    ...
    [ 1641.073601] Call Trace:
    [ 1641.073608]  <TASK>
    [ 1641.073615]  timerqueue_del+0x2e/0x50
    [ 1641.073632]  tmigr_update_events+0x1b5/0x340
    [ 1641.073650]  tmigr_inactive_up+0x84/0x120
    [ 1641.073663]  tmigr_cpu_deactivate+0xc2/0x190
    [ 1641.073680]  __get_next_timer_interrupt+0x1c2/0x2e0
    [ 1641.073698]  tick_nohz_stop_tick+0x5f/0x230
    [ 1641.073714]  tick_nohz_idle_stop_tick+0x70/0xd0
    [ 1641.073728]  do_idle+0x19f/0x210
    [ 1641.073745]  cpu_startup_entry+0x29/0x30
    [ 1641.073757]  start_secondary+0x11e/0x140
    [ 1641.073768]  common_startup_64+0x13e/0x141
    [ 1641.073794]  </TASK>
    

    What’s happening here leading up to the panic is start_secondary followed by cpu_startup_entry, eventually ending up in CPU idle time management (tmigr), giving a context of “waking up/sleeping an idle CPU”. I’ve had a few systems in my life where somewhat aggressive power-saving settings in the BIOS were not cleanly communicated to Linux, so to say, causing such issues.

    This area is notorious for being subtly borked, but you can test this hypothesis easily by either disabling a setting akin to “Global C States” in your BIOS, which effectively disables power-saving for your CPUs, or try an equivalent setting of the kernel arguments processor.max_cstate=1 intel_idle.max_cstate=0, or even a cpuidle.off=1.

    This is obviously losing your power-saving capability of the CPUs, but if your system runs stable that way, you’re likely in the right ballpark and find a specific solution for that issue, possibly in a BIOS/Fimware update. Here’s a not too shabby gist roughly explaining what c-states are. Don’t read too many of the comments, they’re more confusing than enlightening.

    The kernel docs I linked to above are comprehensive, and utterly indecipherable for a layperson. Instead of fumbling about in sysfs, try the cpupower tool/package to visualize the CPU idle settings, and try increasing enabled idle states until your system crashes again, to find out if a specific (deep) sleep state triggers your issue, and disable exactly that if you cannot find a bugfix/BIOS update.

    If this is your problem, to reproduce the panic, try leaving your system as idle as possible after bootup. If a panic happens regularly that way, try starting processes exercising all your CPUs - if the hypothesis holds, this should not panic at any time, as no CPU is ever idle.


  • In principle, yes, although the optional base-devel group depends on sudo, as do hundreds of AUR packages, and yet other projects you install tend to just assume you have a sudo binary around. Removing sudo will not break your system, but lead to well-deserved anger towards people not declaring their dependencies. :)

    If you’re ready to deal with the occasional application or script croaking, and subsequently fixing it, you should be fine removing sudo. I would personally consider building a dummy package replacing sudo, including a simple run0 wrapper at /usr/bin/sudo, to have a clean and transparent replacement.