I have also been done in many times by git-filter-repo. My condolences to the chef.
- 0 Posts
- 130 Comments
Outdated image, everything goes through palantir now
WalnutLum@lemmy.mlto Linux@lemmy.ml•ChatGPT's o3 Model Found Remote Zeroday in Linux Kernel Code3·17 days agoThere’s a lot of assumptions about the reliability of the LLMs to get better over time laced into that…
But so far they have gotten steadily better, so I suppose there’s enough fuel for optimists to extrapolate that out into a positive outlook.
I’m very pessimistic about these technologies and I feel like we’re at the top of the sigma curve for “improvements,” so I don’t see LLM tools getting substantially better than this at analyzing code.
If that’s the case I don’t feel like having hundreds and hundreds of false security reports creates the mental arena that allows for researchers to actually spot the non-false report among all the slop.
WalnutLum@lemmy.mlto Linux@lemmy.ml•ChatGPT's o3 Model Found Remote Zeroday in Linux Kernel Code7·17 days agoIt found it 8/100 times when the researcher gave it only the code paths he already knew contained the exploit. Essentially the garden path.
The test with the actual full suite of commands passed in the context only found it 1/100 times and we didn’t get any info on the number of false positives they had to wade through to find it.
This is also assuming you can automatically and reliably filter out false negatives.
He even says the ratio is too high in the blog post:
That is quite cool as it means that had I used o3 to find and fix the original vulnerability I would have, in theory, done a better job than without it. I say ‘in theory’ because right now the false positive to true positive ratio is probably too high to definitely say I would have gone through each report from o3 with the diligence required to spot its solution.
WalnutLum@lemmy.mlto Linux@lemmy.ml•ChatGPT's o3 Model Found Remote Zeroday in Linux Kernel Code5·17 days agoI’m not sure if the Gutenberg Press had only produced one readable copy for every 100 printed it would have been the literary revolution that it was.
WalnutLum@lemmy.mlto Selfhosted@lemmy.world•Good experience with neko remote browserEnglish2·18 days agoI’m not sure if it would work for your situation but you seem to be able to ssh into a server on that network? If so you can run a browser on that computer and tunnel the X session over ssh:
https://www.cyberciti.biz/tips/running-x-window-graphical-application-over-ssh-session.html
Otherwise neko seems neat, I’ve actually been looking for something for watch parties.
WalnutLum@lemmy.mlto Linux@lemmy.ml•ChatGPT's o3 Model Found Remote Zeroday in Linux Kernel Code60·18 days agoThe Blog Post from the researcher is a more interesting read.
Important points here about benchmarking:
o3 finds the kerberos authentication vulnerability in the benchmark in 8 of the 100 runs. In another 66 of the runs o3 concludes there is no bug present in the code (false negatives), and the remaining 28 reports are false positives. For comparison, Claude Sonnet 3.7 finds it 3 out of 100 runs and Claude Sonnet 3.5 does not find it in 100 runs.
o3 finds the kerberos authentication vulnerability in 1 out of 100 runs with this larger number of input tokens, so a clear drop in performance, but it does still find it. More interestingly however, in the output from the other runs I found a report for a similar, but novel, vulnerability that I did not previously know about. This vulnerability is also due to a free of sess->user, but this time in the session logoff handler.
I’m not sure if a signal to noise ratio of 1:100 is uh… Great…
WalnutLum@lemmy.mlto Linux@lemmy.ml•ChatGPT's o3 Model Found Remote Zeroday in Linux Kernel Code8·18 days agoThis would feel a lot less gross if this had been with an open model like deepseek-r1.
I’m not sure how you’re getting wallpaper engine to work on Linux because it’s not supported on anything other than windows.
Are you using Wallpaper Engine? If so you are likely going to keep having issues with your screen blanking while you try and use it, as it’s not supported on Linux.
WalnutLum@lemmy.mlto World News@lemmy.ml•Bluff: All Imported Honey from China Fake, New Tests Show24·20 days agoThe article you’re commenting on is about EU grocery store honey being fake
WalnutLum@lemmy.mlto Free and Open Source Software@beehaw.org•You Can Choose Tools That Make You Happy2·22 days ago“The future” is whatever the majority of young people decide it will be, regardless of it’s the past or not.
Jokes aside I actually do appreciate that almost all guix packages are verified source and not just copy scripts of already built tarballs.
Guix is awesome!
Nonguix substitute server is down for the fifth straight day, forcing me to rebuild the entire Linux kernel when updating
And you should Never use it!
WalnutLum@lemmy.mlto Technology@beehaw.org•Google's new AI video tool floods internet with real-looking clips10·25 days agoI feel like it’s more likely more and more young people will just retreat from the open internet and only trust content from people they already know don’t post AI clips.
I agree to some degree but the gnu project doesn’t have a great track record for performative hosting (savannah is very prone to going down for long periods of time.)
I don’t begrudge better hosting infrastructure from a different non-profit.
As a guix user and package maintainer I’m ecstatic.
I’m so proud of the community for rallying around the needs and pain points of everyone and making this decision. This reduces so many pain points for a guix user and will hopefully smooth out the package maintenance process a great deal. Email is simple but trying to do code change communication over it can be very complex and time-laborous.
If you’re curious about functional packaging systems grab guix on your distro and give it a try!
Special shout out to anyone burnt out on Nix lang. Come feel the warm embrace of Scheme’s parentheses. :)
What other digital payment system than crypto allows cold wallets?
WalnutLum@lemmy.mlto Linux@lemmy.ml•What helps people get comfortable on the command line?5·1 month agoOn that front: to developers-
Please make sure you include bash completions for your tools
I’m no expert, but usually when missiles “go ballistic” their engines turn off and they have limited maneuvering capability at the end of their flight.
This one looks like it had engines on all the way to the target, which is a fairly newer class of design.