don't let LLMs erode first principles thinking

2025-01-12

background

consider a trader who is backtesting a series of alphas that target the 30 DTE contracts on the S&P 500 Index options chain. to increase sample size, the trader uses a window of +/- one day from the target.

the trader considers the following:

in order to avoid adding complexity where unncessary, the trader starts question the premise:

use an llm!

through the selectively biased lens of my day-to-day experiences, it seems increasingly more common to default to machine-first thinking instead of human-first thinking. stated another way, there seems to be an upward trend of outsourcing logic and reasoning.

for clarity, machine-first thinking is not the same as machine-aided thinking. in my mental model, machine-aided thinking is a subset of human-first thinking:

of course, machine-first thinking is represented in a reductive manner above; to capture the notion that a human supervises the reasoning, let us expand the notation a bit:

the expression of C(1-ε)m+εh simply captures:

for example, one might say that someone who blindly accepts machine reasoning has ε=0:

furthermore, by definition, human-first thinking blindly rejects machine reasoning i.e. ε=1:

while i am admittedly a staunch contrarian when it comes to the topic of LLMs, i do concede they have realized utility in some applications. my intention is not to debate the utility curve of LLMs; rather, i aim to invite deeper thinking about effects of human-LLM interactions.

outsourcing logic and reasoning may be perfectly valid in some contexts, but not all contexts. what i worry about most is that continuous exposure to outsourcing logic and reasoning in valid contexts erodes the human detection mechanism responsible for drawing that line in the first place.

as of writing, machines cannot reason from first-principles; therefore, humans must become more resilient in explicitly protecting the spaces in which outsourcing logic and reasoning may lead to less than ideal outcomes. a serendipitous, cherry-picked personal example follows for those interested in virtually useless proofs.

virtually useless proof

if one were to take trading days T and weekends W as such:

T={0,1,2,3,4}

W={5,6}

if today tT and target DTE D, one might observe that deriving day of the week d{TW} for D is simply:

(t+D)mod7=d

for 30±1 DTE strategies, one is never presented two equivalent choices for days-to-expiry in non-holiday conditions.

using the above, one observes when the possibility of a choice may arise:

dWt{3,4}

that is to say, a choice may only happen on Thursday or Friday. given, two choices: D-1' and D+1' representing 29 DTE and 31 DTE, respectively:

(3+D-1')mod7=4

(3+D+1')mod7=6

(4+D-1')mod7=5

(4+D+1')mod7=0

thus, t=3 is always presented D-1' while t=4 is always presented D+1'.

visual proof

using 0 to indicate the current trading day, there exists a nice visual proof; make note of when t=3 (thursday) is the current day (0) and t=4 (friday) is the current day (0).

calendar-based visual proof of written modular arithmetic proof

closing thoughts

questioning assumptions and re-stating base truths often can combat misguided efforts, such as adding complexity where none is required. this is the essense of human-first thinking that must continue to be protected in the age of outsourcing logic and reasoning.

as always, it is helpful to take a step back every now and then and ask,

why?

appendix: llm responses

note: in no way are the results below meant to be cherrypicked, i do not pay for any LLM services. as such, the results below are derived using the "smartest" free model i could select.

additionally, there's usually more than one way to do the same thing; i am sure there exists a "machine-first power user" who could likely construct the proof above (or some similarly acceptable proxy of the proof) in a similar amount of time that i spent constructing it human-first... perhaps even shorter?

Prompt

SPXW options have 5 daily expirations (Monday-Friday) I want to trade an options strategy that buys 30 DTE contracts, but I want to include +/-1 day in my range (29-31 DTE) to ensure I get enough samples when 30 DTE isn't available.

Give me a list by weekday (Monday-Friday) of which expiries (29, 30, 31 DTE) are available.

ChatGPT Response

Monday

Tuesday

Wednesday

Thursday

Friday

Notes:

Gemini Response (2.0 Flash)

Here's a breakdown of the available SPXW expirations (29, 30, and 31 days to expiry) by weekday: