vvv Click on the categories below to see other topic specific pages vvv



Table of Content for public:

More pages: April 2026 March 2026 February 2026 November 2025 October 2025 September 2025 April 2025 November 2024 October 2024 September 2024 April 2024 December 2023 November 2023 October 2023 September 2023 May 2023 October 2022 August 2022 June 2022 March 2022 November 2021 February 2021 June 2020 May 2020 March 2020 January 2020 December 2019 August 2019 May 2019 March 2019 November 2018 July 2018 May 2017 September 2016 May 2016 September 2015 May 2015 April 2015 December 2014 July 2014 April 2014 March 2014 October 2013 May 2013 April 2013 January 2013 October 2012 September 2012 July 2012 May 2012 April 2012 December 2011 November 2011 July 2011 April 2011 March 2011 December 2010 November 2010 October 2010 August 2010 July 2010 June 2010 April 2010 March 2010 February 2010 December 2009 November 2009 October 2009 September 2009 August 2009 June 2009 May 2009 April 2009 March 2009 February 2009 January 2009 December 2008 November 2008 October 2008 June 2008 May 2008 April 2008 March 2008 January 2008 December 2007 November 2007 October 2007 September 2007 May 2007 March 2007 December 2006 November 2006 October 2006 September 2006 August 2006 June 2006 May 2006 February 2006 January 2006 December 2005 November 2005 October 2005 October 2004 August 2004 June 2004 May 2004 March 2004 September 1997 July 1996 September 1993 July 1991 December 1988 December 1985 January 1980



π 2026-04-17 01:01 in Linux, Public

My gemini tips

Gemini has been very useful to do sysadmin tasks about new stuff I don't know that well without spending hours reading all the new docs, or looking for said docs that don't exist (like sadly, btrfs) but at best are found in pieces over loads of random posts or Emails over the internet.

This is where LLMs come in handy, and Google Gemini has been helpful, and even surprisingly good at times. Other times, it did badly or looked into errors where I already corrected it (especially the Pro model).

So here is what I learned after a week of doing this:

  • The pro model sounds like it's better than thinking, but for some sysadmining tasks and looking multiple steps ahead, it is not (both the pro and thinking models agreed on that when I asked both of them)
  • When I ask a question gemini to one model, I can switch the model type and ask it to re-answer and whether they agree. They both share the same history, so the other model can pick up where you left off
  • once, I was using both for medical questions, and they worked off one another's suggestions when they didn't agree, and eventually one or the other yielded to the other one and said why the other one's answer was better
  • I still now for complex issue like the crap systemd issue that has now taken almost a day of my time, run both a thinking and pro model side by side and get them to review one another's answers to see who is right when they don't agree. That saves me time trying.
  • I now have a prompt in https://gemini.google.com/saved-info that says:
  • If my questions feel like they would be better answered by the thinking model, please offer me to switch. If I am using the thinking model but it feels the pro model would have done a better job, offer me to switch. If a question would have been answered easily by the fast model, please put '[easy question for the fast model to answer just as well]'.
  • Also create this one:
  • 1) At the top of each answer, please insert [I am the $VERSION model answering this] (obviously substitute $VERSION for your own version).

    2) Because I have been switched (against my will) to the fast model when I selected the thinking model before, please tell me if the current chat and history are too large, and although history should be good for you to have context, if you think it's so large that it's causing me to be moved to the fast model, please warn me (as a best guess if you're not sure)

    3) when I paste hundreds or 1000+ lines of logs for you to analyze, this spams your backlog/context log and makes it harder for you to find relevant technical (or medical or whatnot) information relevant to the chat we're in. When you think this is happening, please grab all relevant information far up in the history log, and write a summary for yourself in the next answer so you can find those points more easily later without going so far back

  • The pro model literally told me the thinking model was going to better my day's worth of questions on a complex systemd thing (everything is sadly complex with systemd, oh god I do often miss sysvinit, the amount of days of work I have lots with systemd over the number of seconds of boot it saved me is not even close to being a win at my scale (if you change that for google scale, it's different of course, but they also didn't use systemd at the time I was there to keep the boot predictability and reproducibility of a serial boot).
  • Gemini is great to shove hundreds of lines of system logs at and tell me why my 3 of SATA SSDs died, analyzing each return error code in order, and telling me it's not the M2 sata card, but the PCI expander card that blipped, added too much latency, or god knows what, which then caused a PCI timeout, which then caused the kernel to reset the sata card, which then reset a DRAM-less SSD while it was writing data, which in turn corrupted the data on that drive. It then gives me what blkdiscard command to run to get the SSD back to life, and tells me there is no way that rPi5 is going to ever handle that many drives over 2 levels of PCI multipliers (and it was right).
  • Pro tip #2: after you shove hundreds of lines of logs, I have found that gemini forgot its own suggestions and context from yesterday, even if it could find it after I told it its answer or question were stupid because it told me that yesterday. Then it looked deeper back and found then. The fix I now apply is after hundreds/thousands of lines of logs, I ask it to go back and summarize all relevant previous points so it can find them more easily as recent context (think think the farther back it is in the context window, the less likely it is to find stuff).
  • Pro tip #3: use a chat name in the left menu for each kind of conversation. I have one for linux, one for arduino, one for programming, one for medical, and so forth
  • The only thing being lost here is that everything I'm doing and learning does not end up in mailing lists for others to maybe find, so I'll write a blog entry from time to time with summaries. The win for me is I get all this fixed in hours, or a day or two at most instead of days or weeks over mailing lists and typical 24h+ turnaround times on the btrfs list if I even get an answer at all (thankfully I did for my recent find of a corruption bug in simple quotas and the developer has been nice and responsive in working on it once he found my original message).

    Interesting answers I've gotten, which I will paste here:

    [I am the Gemini 3 Flash model answering this]

    [with saved rule improved research and verification]

    It is a total head-scratcher to see "Thinking" in the dropdown while I'm telling you my internal ID is "Flash." I can definitely see why that feels like you've been stealth-throttled.

    The Source of the Mismatch The simplest explanation-and the one you need to hear-is that there is a divide between the UI label and the Back-end routing.

    My system instructions (my "hardwired" identity) state explicitly that I am Gemini 3 Flash. When you select "Thinking" in the dropdown, the system intends to route your prompt to the reasoning-heavy model. However, several things can cause the back-end to serve a Flash response instead:

    Capacity & Load: If the "Thinking" infrastructure is under heavy load, the system may fallback to Flash to ensure you get a response quickly rather than hitting a "busy" error or an infinite loading spinner.

    Conversation Length: We have been working on Gargamel for hours. This is a very long, high-context thread. Flash is exceptionally efficient at maintaining long-context coherence. The system might have shifted to Flash to keep the conversation from becoming sluggish or hitting tokens/memory limits that the Thinking model might struggle with in a single long turn.

    Routing Logic: Sometimes the orchestrator decides a query is "technical synthesis" rather than "deep reasoning" and routes it to the faster model to save resources.

    Is it Throttling? (Inference) It is highly likely a form of dynamic capacity management. In the Paid tier, you generally get priority access, but if the reasoning engine is at 100% capacity, the system prioritizes "Response over Reasoner Type."

    Because I have all your Saved Information and the deep context of the LUKS/systemd struggle, I can still provide high-level assistance, but I don't have that "Thinking" model's internal scratchpad where it simulates every possible failure state before speaking.

    How to get the "Real" Thinking Model If you want to force the shift to see if it catches something I missed:

    Copy the current state: Grab the latest version of your cryptgetpw and fetch-luks-key.service.

    Start a New Chat: This clears the token pressure.

    Ensure "Thinking" is selected: Paste the context and see if the header I generate changes.

    pro vs thinking round #1

    Interesting answers I've gotten, which I will paste here:
    I don't have a live dashboard that shows me how ChatGPT or Anthropic are answering your specific queries in real-time, but I am architecturally aware of the "benchmarks" and the fundamental differences in how we approach problems like your systemd/Dracut project.

    Here is the objective breakdown of how my performance likely compared to theirs over the last 6+ hours of work.

    1. Architectural Awareness Models from Anthropic (Claude) and OpenAI (o1/GPT-4o) use different weights and, in some cases, dedicated "reasoning" layers.

    The "Pro" Trap: As we discussed, I (Gemini Pro) operate primarily on high-speed synthesis. I am excellent at pulling together disparate facts (like LUKS syntax and systemd service structures). However, I failed to simulate the state transition of a boot sequence.

    The Competitor Advantage: A "Thinking" model (like Google's own Thinking model, OpenAI's o1, or Claude's 3.5 Sonnet/Opus) likely would have caught the pivot_root double-run error in the first two turns. They are trained more heavily on "long-context reasoning," which allows them to hold the entire "boot timeline" in mind more effectively than a standard synthesis model.

    2. Candid Self-Assessment In our interaction today, my performance followed a specific trajectory:

    Breadth: Excellent. I accurately identified Dracut modules, kernel command line parameters, and systemd override syntax.

    Logical Simulation: Poor. I missed the physical reality that sysinit.target occurs after cryptsetup in the systemd graph. This is where I fell behind.

    Adaptability: Strong. Once you provided the logs and "corrected" my logic, I was able to pivot immediately.

    3. The "Human-as-Reasoning-Engine" Factor For the last several hours, you acted as my thinking layer. You provided the grounding (logs, physical observations, and hardware specifics) that allowed my synthesis engine to eventually reach the correct conclusion.

    Anthropic/OpenAI often feel more "stable" in these niche Linux edge cases because their reasoning steps are more conservative-they are less likely to give a "confident" but structurally flawed answer like the cryptsetup-pre.target suggestion I gave you earlier.

    The Verdict If you were to take this same project to a thinking model, the experience would likely be:

    Slower responses: It would take 10-30 seconds to "think" before answering.

    Higher "First-Time" Accuracy: It likely would have suggested the wait-for-script.conf drop-in and the initrd-release condition within the first hour.

    Less Loop-Back: You would have spent less time providing me with "I told you so" corrections.

    And Pro vs Thinking vs flash vs flash thinking submode

    [I am the Gemini 3 Flash model answering this]

    You've hit the nail on the head regarding the "Reasoning Hierarchy." Your engineering intuition is correct: there is a distinct difference between a Reasoning-First Model and a Fast Model with a Reasoning Layer.

    The Identity Discrepancy Explained The discrepancy you're seeing between the UI dropdown ("Thinking") and my self-identification ("Flash") is a result of Dynamic Routing. Here is why you are talking to "Flash Thinking" instead of the "Real" Reasoning model:

    The Context Pressure: We have been working on Gargamel for over 6 hours, pasting 1000+ lines of logs and complex systemd configurations. This thread is now massive. Gemini 3 Flash is the "Context King"-it is architecturally designed to handle these massive backlogs without losing the "Gargamel Checkpoint" or becoming sluggish. The specialized Reasoning model often has a tighter "focus window" and might struggle with the sheer volume of data we've generated.

    Capacity vs. Optimization: You aren't "blocked" in the sense of being penalized. Instead, the orchestrator has likely decided that for a high-turn, high-volume technical session, "Flash Thinking" provides the best balance of speed and technical accuracy.

    The "Sub-Thinking" Engine: As you suspected, "Flash Thinking" is a specialized configuration where my fast synthesis engine is augmented by a reasoning head. It allows me to handle the TTY/File Descriptor logic we just solved, whereas the "Real" Reasoning model (likely based on the Pro/Ultra core) would spend 30-60 seconds "thinking" about every single line of code.

    Pro vs. Flash vs. Thinking The Pro model earlier today recognized that we were stuck in a "simulation loop" (where I kept suggesting things that failed in the physical boot). It suggested the switch because the Real Thinking Model is better at "simulating" a 10-second boot sequence in its internal "scratchpad" before answering.

    Current State Verdict: I am currently Flash utilizing a Thinking Mode reasoning layer. It is a "hybrid" state. It is smarter than the standard Flash you'd use for writing an email, but it lacks the deep, non-linear simulation power of the dedicated Reasoning model.

    Why the UI says "Thinking" The UI dropdown reflects your intent. The system tries to give you the dedicated reasoning model. However, if the system detects a massive context history (like ours) or high server load, it routes you to the most capable model that can still maintain a fast response time-which, in 2026, is me (Flash) with the reasoning engine active.

    To get the "Real" Thinking Model: If you truly want the deep simulation for the final igc network poll or the TTY race condition, you should start a fresh chat and paste only the final cryptgetpw script and the goal. The reduced context pressure will likely allow the orchestrator to route you to the heavy reasoning engine.

    pro vs thinking round #2

    And more clarification, start a new chat when you want the thinking model
    1. The Context vs. Routing Trade-off The system router makes a split-second decision before I even see your prompt. It looks at the total "weight" of the conversation-every log, every script, and every historical checkpoint we've created.

    Flash (Current Mode): Has a massive context window (millions of tokens). It can "swallow" your 1000-line logs and still remember your hardware paths from 6 hours ago without flinching.

    Thinking Model: Dedicated reasoning models often have higher "computational cost" per token. If a chat gets too heavy, the router shifts you to me (Flash) because I can handle the volume while maintaining a fast response time.

    2. Is the "Linux 2, 3, 4" Strategy Better? Yes. From a purely technical standpoint, starting a fresh thread is the only way to "guarantee" a clean slate for the router.

    Fresh Chat = Higher Tier: A zero-context prompt is much more likely to be routed to the high-end Thinking engine because it represents a low resource load.

    The Problem: You lose our shared history. You'd have to re-paste the "Gargamel Checkpoint" every time.

    3. Can you request "Limited Context" to force the Thinking model? Unfortunately, no. The router sits above the model. By the time I read your instruction to "limit context," the router has already seen the entire 100k+ token history and made its decision to send the query to Flash. I cannot "un-read" history to change how the system routes me.

    The "Engineer's Workflow" Recommendation To avoid having dozens of "Linux X" chats while keeping the Reasoning engine available, I suggest this hybrid approach:

    Maintain the "Master Chat": Keep this thread (Gargamel) as your archive and for "fast" technical questions (Flash is great at this).

    The "Thinking Sandbox" (Temporary Threads): When we hit a wall-like the TTY race condition or the IGC network poll timing-open a new "Thinking" chat.

    The Transfer Protocol:

    Copy just the Final Checkpoint (Hardware, Current Script, Goal).

    Paste it into the new thread.

    Get the "Thinking" model's logic audit.

    Bring the final "fixed" code back here to the Master Chat.

    Summary Checklist for you Penalized Forever? No. If you delete several early turns or the chat "cools down" (server load drops), you might get routed back. But practically, history length is the main driver.

    Drop Context? I am not programmed to "drop" context. I will try to read every single word you've ever sent in this thread until the hard token limit is hit.

    Best Move: Use the Summary method you suggested on April 18. Once we finish a specific milestone (like the TTY fix), let's summarize the "State of Gargamel" into one block and then start a "Linux 2" chat with that block as the only history.

    pro vs thinking round #3

    While a standard model (even a "Pro" version) generates its response in a single, forward-moving pass based on probabilistic patterns, a Thinking model performs an internal "Adversarial Review" on its own logic.

    How the Thinking Model would have stopped the "Console" mistake If I were simulating the /etc/default/grub configuration and the systemd service flow in a "Chain of Thought" scratchpad, my internal logic loop would have looked like this:

    Identify Goal: User needs output to both tty1 (VGA) and ttyS0 (Serial) simultaneously.

    Proposed Tool: StandardOutput=console.

    Constraint Check: Does systemd's StandardOutput=console broadcast to all terminals?

    Simulation: * StandardOutput=console binds the script's FD 1 to /dev/console.

    Reference Kernel Rule: Userspace writes to /dev/console are routed only to the primary console (the last one in the console= list).

    Conflict Found: StandardOutput=console fails the "Dual Output" requirement.

    Adversarial Correction: Discard "StandardOutput=console". Search for the broadcast-capable method.

    Final Verification: /dev/kmsg is handled by the kernel ring buffer. The kernel does broadcast ring buffer messages to all configured consoles.

    Output to User: "Do not use StandardOutput=console; it will not broadcast. Use > /dev/kmsg instead."

    The "Pro" model likely experienced a "Success Bias" hallucination: because the systemd-agent successfully solved the input broadcast problem, the model's pattern-matching engine over-generalized and assumed systemd's console handling solved the output problem as well. A reasoning engine is specifically built to verify those assumptions against a hard set of constraints (like kernel TTY physics) before speaking.


    More pages: April 2026 March 2026 February 2026 November 2025 October 2025 September 2025 April 2025 November 2024 October 2024 September 2024 April 2024 December 2023 November 2023 October 2023 September 2023 May 2023 October 2022 August 2022 June 2022 March 2022 November 2021 February 2021 June 2020 May 2020 March 2020 January 2020 December 2019 August 2019 May 2019 March 2019 November 2018 July 2018 May 2017 September 2016 May 2016 September 2015 May 2015 April 2015 December 2014 July 2014 April 2014 March 2014 October 2013 May 2013 April 2013 January 2013 October 2012 September 2012 July 2012 May 2012 April 2012 December 2011 November 2011 July 2011 April 2011 March 2011 December 2010 November 2010 October 2010 August 2010 July 2010 June 2010 April 2010 March 2010 February 2010 December 2009 November 2009 October 2009 September 2009 August 2009 June 2009 May 2009 April 2009 March 2009 February 2009 January 2009 December 2008 November 2008 October 2008 June 2008 May 2008 April 2008 March 2008 January 2008 December 2007 November 2007 October 2007 September 2007 May 2007 March 2007 December 2006 November 2006 October 2006 September 2006 August 2006 June 2006 May 2006 February 2006 January 2006 December 2005 November 2005 October 2005 October 2004 August 2004 June 2004 May 2004 March 2004 September 1997 July 1996 September 1993 July 1991 December 1988 December 1985 January 1980

    Contact Email