feat(DRAFT): Add `get_supertype` by dangotbanned · Pull Request #3396 · narwhals-dev/narwhals

dangotbanned · 2026-01-10T12:20:21Z

Description

Important

@FBruzzesi and I have been + are still iterating on this
Core functionality is there, focusing on readability, performance + shrinking the test suite

This PR implements polars' concept of supertyping - which more generally defines which types can be safely promoted/demoted/cast to other types.

I really like the DuckDB visualization of their version¹ of these rules, so here's that for an example:

Show Casting Operations Matrix

This is a preliminary step for implementing relaxed concat (#3386).
The aim is we own a consistent set of rules that all/most backends can participate in.
We've already dropped some supertypes that are valid in polars, but may prove challenging in other backends such as #121.
Some others are directly mentioned in comments (e.g. (Struct, DType) -> Struct)

Additional use-cases

Supertyping in polars is used for much more than just a subset of concat.
In (#2572), it is one of the larger concepts missing from the intermediate representation (see #3386 (comment)).

polars-plan::plans::conversion::type_coercion is full of examples of how deeply related the concept is with expressions.
My aim is not to reproduce all of that 😅 - but to be able to reason about DTypes between LazyFrame operations without querying the backend for a Schema between every step 🤞

Related issues

Tasks

DuckDB also mentions another set of rules called Combination Casting - that is entirely implicit.
The matrix doesn't relfect these and only one cast example is given, but it would apply to nw.concat:
"This combination casting occurs for ..., set operations (UNION / EXCEPT / INTERSECT), and ..." ↩

Much easier to pick one to debug this way

We can safely use an unbounded `@cache`, because there can only be 16 valid pairs

Oops

Related #3386, #3396, #3398

As much as is possible without #3396

Need to decide how many of the others to leave as todos Main theme is needing `get_supertype` (#3396)

Everything left requires `get_supertype` (#3396)

* refactor: Replace `_same_supertype` with a custom `@singledispatch` This is more generally useful and a LOT easier to read from the outside * refactor: Just use a real class * fix(typing): Satisfy `mypy` * fix: Oops forgot the first element * refactor(typing): Use slightly better names * chore: Rename `default` -> `upper_bound` * docs: Replace debugging doc * docs: More cleanup * refactor: Use `__slots__`, remove a field * docs: More, more cleanup * docs: lil bit of `.register` progress * cov * test: Get full coverage for `@just_dispatch` * chore: Give it a simple repr * test: Oops, forgot that was an override * revert: Keep only what is required See #3396 (comment) * refactor: Simplify `@just_dispatch` signature * fix(typing): Satisfy mypy * test: Gotta get that coverage Resolves #3410 (comment) * docs: Restore a minimal version of `@just_dispatch` doc Resolves #3410 (comment) * revert: Remove `Impl` alias #3410 (comment) * refactor: Rename `Passthrough` -> `PassthroughFn` Suggested in #3410 (review) * docs: Add note to use only on internal Suggested in #3410 (review)

MarcoGorelli

Are we sure we should be doing this?

I don't think different libraries follow the same supertyping rules, and i'm not sure it's something we should impose

e.g. Datetime('us') vs Datetime('ns'): Polars goes to the former, pandas to the latter

In [16]: df = pl.DataFrame({'a': [datetime(2020,1,1)]})

In [17]: pl.concat([df.with_columns(pl.col('a').cast(pl.Datetime('ns'))), df], how='vertical_relaxed')
Out[17]:
shape: (2, 1)
┌─────────────────────┐
│ a                   │
│ ---                 │
│ datetime[μs]        │
╞═════════════════════╡
│ 2020-01-01 00:00:00 │
│ 2020-01-01 00:00:00 │
└─────────────────────┘

In [18]: pd.concat([df.with_columns(pl.col('a').cast(pl.Datetime('ns'))).to_pandas(), df.to_pandas()], axis=0).dtypes
Out[18]:
a    datetime64[ns]
dtype: object

dangotbanned · 2026-03-30T11:52:07Z

I don't think different libraries follow the same supertyping rules, and i'm not sure it's something we should impose

But doesn't that inconsistency show an example of how - if we don't address it - there's a knock-on effect to things like selectors?

IMO, (#3396 (review)) is the kind of thing that won't be an issue to most use cases - but when it is, it could be a slog to debug.

I wanna stress that my goal is a set of rules.
Those should be:

what we can realistically support across all/most backends
something downstream can depend on for correctness
- @FBruzzesi did a really good job on the explainability of them btw 🥳

I like the rules we have here, but I'm still open to more fiddling 🙂

MarcoGorelli · 2026-03-30T12:59:12Z

i think it could also be a slog to debug when someone switches from pandas (or some other library) to narwhals

for selectors, i think it's fairly common to select from kind (like all temporal columns, or all datetime ones) rather than some exact dtype (like Datetime('us'))

FBruzzesi · 2026-03-30T20:45:23Z

@MarcoGorelli I find the polars behavior a bit odd. I didn't check what other backends do nor I could find a polars issue on the topic.

What would you propose to do here? I guess one option is that we start by not allowing supertyping for datetime and duration dtypes unless they have the same time_unit. That's probably the safest approach to begin with, as a user can always decide how to do it externally if needed

MarcoGorelli · 2026-03-31T07:12:13Z

Yup, happy for supertyping datetimes of different resolutions to raise

Is this mostly about int32 vs int64 -> int64 kind of operations? If so, I think those at least should be fairly standardised, ok with doing those. Is there any other kind of supertyping that this PR does?

sorry i

165 hidden items
Load more…

haven't clicked through everything

FBruzzesi · 2026-03-31T07:22:46Z

Thanks @MarcoGorelli

Yup, happy for supertyping datetimes of different resolutions to raise

Alright, I think that can be a starting point. @dangotbanned WDYT?

sorry i

165 hidden items
Load more…

haven't clicked through everything

Yes I feel you, this is definitely a large one with a lot of commits.

I guess the quickest way to get a grasp of it is the documentation page we have written, or even quicker a chart.

The TL;DR is that:

Numeric casting is allowed (including Boolean)
Categorical to String, Enum to String, String to Binary
Nested types are resolved recursively, and with some special casing:
- When combining a List with an Array, the supertype is a List if both have the same depth
  (nesting level).
- Struct non-trivial case cannot be expressed in one line 🤣

MarcoGorelli · 2026-03-31T07:51:51Z

thanks! 🙏

i've read through, and my initial reaction is that this is too complicated

why are we dealing with String vs Int64 for example? do we have a use-case for that?

FBruzzesi · 2026-03-31T08:23:58Z

To me the buggy part is that in the current implementation of concat every backend has a different behavior (#3191 (comment)).

This PR is a pre requisite to have a consistent behavior:

concat(..., how={"vertical", "diagonal"}) to be strict for all backends, as polars does it -> This does not actually require this PR, but that in chore: Align nw.concat(..., how="vertical") behavior across backend for schema mismatch #3191 the concern was that we would not have a way to allow supertyping.
concat(..., how="*_relaxed") to be loose as polars is via supertyping.

Coming to:

why are we dealing with String vs Int64 for example? do we have a use-case for that?

casting numeric to string is standardized across backends, I don't see why that would be problematic to support.

If you are up for it, let's have a call chat to better understand what we could land with this PR

dangotbanned · 2026-03-31T10:06:13Z

concat(..., how="*_relaxed") to be loose as polars is via supertyping

Yep the idea is we can allow more things if we can have consistent semantics by casting first.
That's what polars is doing after all 😉

I made a list of other polars APIs that use these rules ~~, will try to find it later~~

Where we can apply the rules

I think that in a lot of these cases, we wouldn't have coverage (understandably) for what each backend does on it's own.
Sometimes, these things surface in bug reports (#3394), (#2835), (#2082) - but it would be nice for consistency across all of them:

Top-level functions

concat(how="diagonal_relaxed")
concat(how="vertical_relaxed")
coalesce
max_horizontal
mean_horizontal
min_horizontal
sum_horizontal
when

`Expr` methods

fill_null
replace_strict
+
-
*
//
%

`LazyFrame` methods

join
join_asof
unpivot

MarcoGorelli · 2026-03-31T15:01:58Z

are you suggesting that + should result in a consistent output dtype across libraries? still not sure tbh, there's differences due to different design decisions:

In [55]: df = duckdb.sql("""select * from values (1.5), (2.5) df(a)""")

In [56]: duckdb.sql("""
    ...: from df
    ...: select a + a
    ...: """)
Out[56]:
┌──────────────┐
│   (a + a)    │
│ decimal(3,1) │
├──────────────┤
│          3.0 │
│          5.0 │
└──────────────┘

In [57]: df.pl().select(pl.col('a')+pl.col('a'))
Out[57]:
shape: (2, 1)
┌───────────────┐
│ a             │
│ ---           │
│ decimal[38,1] │
╞═══════════════╡
│ 3.0           │
│ 5.0           │
└───────────────┘

In [58]: df.pl()
Out[58]:
shape: (2, 1)
┌──────────────┐
│ a            │
│ ---          │
│ decimal[2,1] │
╞══════════════╡
│ 1.5          │
│ 2.5          │
└──────────────┘

and i'm not sure we should be standardising it

dangotbanned · 2026-04-01T00:31:10Z

are you suggesting that + should result in a consistent output dtype across libraries?

All I'm suggesting is that the list in (#3396 (comment)) is where those rules could be applied.

I know that Decimal is likely to be too different in the new polars version (#3377 (comment))

dangotbanned · 2026-04-01T00:33:07Z

@MarcoGorelli you could pick any combination of types/backends and find examples of incompatibilities.

I don't think it is helpful, considering where I started (#3396 (comment)):

I wanna stress that my goal is a set of rules.
Those should be:

what we can realistically support across all/most backends

I'm gonna try a different angle ...

(Provided we can cast our way there)

What do you think about using <insert-a-set-of-rules-here> in places where a backend would otherwise error?

I think that you're okay with that, and these overlap a lot with (#3396 (comment)):

The main motivator for this PR is supporting another of these cases (#3398), but wanting to do it in a standardised way.

I would like it if you could write this and it is reliable:

nw.concat(items, how="vertical_relaxed")

I know that we can do this.
It'll only ever be a subset of polars - but I know there is a version of this that we can do (even for ibis)
Help us find that version ❤️

MarcoGorelli · 2026-04-01T12:38:49Z

Just so i understand

What do you think about using in places where a backend would otherwise error?

outside of concat, which of the other cases listed error for some backends in a way that dtype supertyping would solve?

dangotbanned · 2026-04-01T16:08:52Z

General

(#3396 (comment))

What do you think about using in places where a backend would otherwise error?

outside of concat, which of the other cases listed error for some backends in a way that dtype supertyping would solve?

The interesting part is that supertyping reduces them all to the same problem.
We have 2 or more DTypes and we need to either:

Pick one
Find one that all can safely cast to
Reject it

How often we'd benefit depends on two things:

How closely do the rules a backend uses map to what polars does?
How consistent is the backend (natively) in applying their own rules?

Examples

I've picked the first example at random.
The other two look at how internally consistent the same problem is solved.

Note

There are lots of code blocks hidden here!

1 - `fill_null`

Show test_fill_null_series_expression

narwhals/tests/expr_and_series/fill_null_test.py

Lines 67 to 81 in 86cd0ab

    
           def test_fill_null_series_expression(constructor: Constructor) -> None: 
        
               data = { 
        
                   "a": [0.0, None, 2.0, 3.0, 4.0], 
        
                   "b": [1.0, None, None, 5.0, 3.0], 
        
                   "c": [5.0, 2.0, None, 2.0, 1.0], 
        
               } 
        
               df = nw.from_native(constructor(data)) 
        
               result = df.with_columns(nw.col("a", "b").fill_null(nw.col("c"))) 
        
               expected = { 
        
                   "a": [0.0, 2, 2, 3, 4], 
        
                   "b": [1.0, 2, None, 5, 3], 
        
                   "c": [5.0, 2, None, 2, 1], 
        
               } 
        
               assert_equal_data(result, expected)

Making this change works for all backends besides pyarrow, which raises here

-    df = nw.from_native(constructor(data))
+    df = nw.from_native(constructor(data)).with_columns(nw.col("a", "b").cast(nw.Float32))

But in most contexts (e.g. arithmetic, coalesce) pyarrow will apply the rule (Float32, Float64) -> Float64.

Show me the goods

import narwhals as nw

data = {
    "a": [0.0, None, 2.0, 3.0, 4.0],
    "b": [1.0, None, None, 5.0, 3.0],
    "c": [5.0, 2.0, None, 2.0, 1.0],
}

df = nw.from_dict(
    data,
    schema={"a": nw.Float32(), "b": nw.Float32(), "c": nw.Float64()},
    backend="pyarrow",
)

>>> df.select(native_promotion=nw.coalesce("a", "b", "c")).schema
Schema([('native_promotion', Float64)])

2 - `coalesce`

pyarrow can handle mixing Int* and Float* in coalesce, like polars too!

Show me more, show me more ...

import polars as pl
import narwhals as nw


native = pl.DataFrame({"a": [None]}).with_columns(
    b=pl.col("a").cast(pl.Int64), c=pl.col("a").cast(pl.Float64)
)
df = nw.from_native(native)

>>> df.with_columns(d=nw.coalesce("b", "c")).schema
Schema([('a', Unknown), ('b', Int64), ('c', Float64), ('d', Float64)])

>>> nw.from_native(df.to_arrow()).with_columns(d=nw.coalesce("b", "c")).schema
Schema([('a', Unknown), ('b', Int64), ('c', Float64), ('d', Float64)])

But it would be tripped up by (#2835):

Show polars nulls

>>> df.with_columns(d=nw.coalesce("a", "b", "c")).schema
Schema([('a', Unknown), ('b', Int64), ('c', Float64), ('d', Float64)])

Show pyarrow nulls

>>> nw.from_native(df.to_arrow()).with_columns(d=nw.coalesce("a", "b", "c")).schema
ArrowNotImplementedError: Function 'coalesce' has no kernel matching input types (null, int64, double)

Even though we can make that work with a cast:

Show cast coming to the rescue

>>> nw.from_native(df.to_arrow()).with_columns(
    d=nw.coalesce(nw.col("a").cast(nw.Int64), "b", "c")
).schema
Schema([('a', Unknown), ('b', Int64), ('c', Float64), ('d', Float64)])

3 - `join`

However, pyarrow won't use these same rules when we use join:

Show polars join

df_pl = nw.from_native(df.to_polars())

>>> df_pl.join(df_pl, left_on="b", right_on="c").schema
Schema([('a', Float32),
        ('b', Float32),
        ('c', Float64),
        ('a_right', Float32),
        ('b_right', Float32)])

Show pyarrow join

>>> df.join(df, left_on="b", right_on="c")
ArrowInvalid: Incompatible data types for corresponding join field keys: FieldRef.Name(b) of type float and FieldRef.Name(c) of type double

Summary

Defining the rules used by #3398 does not directly fix any of these issues - nor change the behavior of those APIs.

But I think these are real issues we could solve, with this PR giving us the tools to explore that in the future.

One nice thing about introducing supertyping for concat(..., how="*_relaxed"), is that you must opt-in.
That gives us a chance to leave breadcrumbs ¹ so everyone is on the same page about what we do here 😄

If I were to suggest how we'd integrate the concept into existing APIs - I think allowing opting-in to supertyping/a subset of it would be the least surprising.
Just a thought for the future 🙂

like DuckDB does with UNION linking to their typecasting rules ↩

camriddell · 2026-04-07T21:10:21Z

This is a big push towards "Narwhals to ensure consistent backend behavior" rather than "Narwhals to let backends do whatever they wish"

I am onboard with the reasoning for this PR. However I have 2 concerns:

backwards compatibility for the cases where the rules expressed here is different than a backend's native "rule" (or lack thereof).
- There will be users who want the backend to do whatever the backend is going to do (e.g. parity testing as one translates a codebase from backend X to Narwhals). I may not have spotted this in the code, but what escape hatches can users rely on to completely subvert the promotion logic implemented here?
maintainability how might you onboard a new contributor/maintainer to this part of the code? We already have a fairly large codebase, and this new system adds 2k lines.
- Do we care to add a "how it works" section added to the docs? The current docs do a great job explaining the promotions available, but a "how it works" could greatly help future onboarding.

dangotbanned · 2026-04-09T15:56:58Z

Sorry for the delay @camriddell!

I really do appreciate the time you put into (#3396 (comment)) ❤️

I wanted to circle-up with @FBruzzesi first, so we could avoid adding too much to the thread (93 96 messages 😳)

This is a big push towards "Narwhals to ensure consistent backend behavior" rather than "Narwhals to let backends do whatever they wish"

Consistency is definitely something I'd like to see more of ¹, but understand that everyone has unique expectations on how far that should go.
I think the way we serve the most people is by offering the choice for more consistency where it is reasonable for us to do it

I am onboard with the reasoning for this PR

Thank you 😍

Backwards compatibility

there will be users who want the backend to do whatever the backend is going to do

General

I want this feature to preserve backwards compatibility in existing APIs.
@FBruzzesi suggested using stable.v*, which I think could work.

(excluding (#3398)), would you like to see something more concrete than the end of my summary here? I have a few ideas.

polars exposes some level of config for supertypes in:

The main get_supertype function we have, could be adapted to be more like supertype::get_supertype_with_options for flexibility.

but what escape hatches can users rely on to completely subvert the promotion logic implemented here
I may not have spotted this in the code

Yeah this isn't implemented yet (see usage in (#3398) schema.py diff) but I agree would be useful to have.

So all together - if we want configuration - I'd be thinking of this as a jumping off point:

Show get_supertype_with_options

Patent-pending on these names

from __future__ import annotations

import enum
from typing import Literal

from narwhals.dtypes import DType

class SupertypeFlags(enum.Flag):
    DEFAULT = 0
    SKIP = enum.auto()
    SOME_RULE = enum.auto()
    ANOTHER_RULE = enum.auto()
    # <insert-more-things-here>
    # e.g. https://github.com/narwhals-dev/narwhals/pull/3430
    RELATED_RULES = SOME_RULE | ANOTHER_RULE


class SupertypeResult(enum.Enum):
    FAILED = enum.auto()
    SKIPPED = enum.auto()

    def __bool__(self) -> Literal[False]:
        return False


def _get_supertype_with_options(
    left: DType, right: DType, options: SupertypeFlags
) -> DType | None: ...


def get_supertype_with_options(
    left: DType, right: DType, options: SupertypeFlags
) -> DType | Literal[SupertypeResult.FAILED, SupertypeResult.SKIPPED]:
    if SupertypeFlags.SKIP in options:
        return SupertypeResult.SKIPPED
    if dtype := _get_supertype_with_options(left, right, options):
        return dtype
    return SupertypeResult.FAILED

If you've gotten this far (hey, thanks!), narwhals/_plan/_flags.py has examples using enum.Flag

`concat(..., how="*_relaxed")` (#3398)

I think this is fine in terms of backwards-compatibility.
It is a new option and for PandasLike, we don't validate the "schemas" in the non-relaxed case anyway 😂

We (prematurely, perhaps?) have documented what these rules are and where they're used:

narwhals/utils/promotion-rules.md.jinja

Line 3 in dd14fd1

    
           When combining columns of different data types (e.g., in `concat(..., how="vertical_relaxed")`),

I would link to this directly in the updated docstring for (#3398)
We could call out PandasLike specifically though, since IIRC we do validate first for other backends.

from a "compatibility layer between dataframe libraries" ↩

dangotbanned · 2026-04-09T15:58:17Z

Maintainability

The current docs do a great job explaining the promotions available, but a "how it works" could greatly help future onboarding.

100% on board with the motivation!
But could I ask - are there any specific parts of the code that you found hard to follow?
Ideally if we can solve that in the code - then we only need to keep one pretty docs page in sync 😅
E.g. some things we've done for performance are not always intuitive, but we can get the best of both worlds with changes in the style of (55a7de3)

If there is an apetite for making this configurable (#3396 (comment)), then explaining the tricky bits inline would be my preference (for now).

The current impl is under the assumption that valid supertypes are fixed and leans into that pretty heavily.
Things like caching, globals and the order we check things would need to adjust to the new world order

dangotbanned · 2026-04-09T16:01:34Z

LOC aside

I feel the need to clear this up, but don't want this to distract from (#3396 (comment)) 🙂

@camriddell

We already have a fairly large codebase, and this new system adds 2k lines.

If we go purely by the full diff, okay yes there is a big +1700.

However, this covers most of the source LOC changes:

dtypes/_supertyping.py +384
narwhals/_dispatch.py +83
A few lines in dtypes/_classes.py

We can reduce the diff by splitting out dtypes.py -> dtypes/ changes if needed?
That would mainly just shrink the number of files touched though, since IIRC it was what gave us the -142.

IMO, those are pretty good figures for a feature that every backend could use in concat + others like (#3396 (comment)) 😏

camriddell

A few minor points & questions on specific code pieces. Nothing high-level.

camriddell · 2026-04-15T00:02:33Z

+    left_fields, right_fields = left.fields, right.fields
+    if len(left_fields) != len(right_fields):
+        return _struct_fields_union(left_fields, right_fields)
+    new_fields = deque["Field"]()


Why use a deque here? It seems that we're only .appending to the object, so a list should be okay?

Also, is there a reason for the typing syntax on the right-side of the assignment?

camriddell · 2026-04-15T00:05:06Z

+    left: Collection[Field], right: Collection[Field], /
+) -> Struct | None:
+    """Adapted from [`union_struct_fields`](https://github.com/pola-rs/polars/blob/c2412600210a21143835c9dfcb0a9182f462b619/crates/polars-core/src/utils/supertype.rs#L559-L586)."""
+    longest, shortest = (left, right) if len(left) >= len(right) else (right, left)


Minor (feel free to ignore/reject), but perhaps this a bit more intent-ful:

shortest, longest = sorted([left, right], key=len)

camriddell · 2026-04-15T00:15:51Z

+    for left_f, right_f in zip(left_fields, right_fields):
+        if left_f.name != right_f.name:
+            return _struct_fields_union(left_fields, right_fields)
+        if supertype := get_supertype(left_f.dtype(), right_f.dtype()):


does this path do any less work than just always calling _struct_fields_union? It feels like the bulk of this entire function could just return _struct_fields_union

I had to stare at this for a while again to see it 😂

I'm gonna definitely add some comments, thanks @camriddell

Note
We did steal both from polars 😉

So the other path is optimized for merging both dtype and name differences.

This one bails on the first name mismatch.

does this path do any less work

If we can avoid the name stuff, simply calling get_supertype a bunch can be quite cheap:

lots of it is frozenset ops and dict lookups

complex cases are aggresively cached 😅

Oh, and the other path requires creating and incrementally building up dict

narwhals/narwhals/dtypes/_supertyping.py

Line 206 in dd14fd1

longest_map = {f.name: f.dtype() for f in longest}

camriddell · 2026-04-15T01:36:37Z

If we avoid overriding the behavior of a backend and only use this feature in the cases where the backend provides no alternative, then the current plan for limited application (e.g. concat) and the system put in-place as codified is +1 from me.

My primary concern (this extends beyond this PR, so take with a grain of salt) is that a native backend comes out with a feature that we've already bolted on top in the Narwhals API. What is the future of this feature? Do we continue to route users through our own implementation? Do we adopt the upstream feature with a version check? Will users be surprised if their results differ depending on an interaction between the version of narwhals and the version of their backend?

The above questions don't need to be answered for this PR, but just highlights where my perspective originates :)

Some follow ups from previous discussion

But could I ask - are there any specific parts of the code that you found hard to follow?

Had some time to take a closer look at the code and I think it's pretty readable (to me)! The main entrypoint is get_supertype which calls specific paths if the passed types share an ancestor or not. However, I have a hard time relating the code back to the rules specified in the great docs you already have. That said, I don't think this is a problem that blocks merging.

If we go purely by the full diff, okay yes there is a big +1700.

However, this covers most of the source LOC changes:

dtypes/_supertyping.py +384
narwhals/_dispatch.py +83
A few lines in dtypes/_classes.py
We can reduce the diff by #3204 (comment) changes if needed?
That would mainly just shrink the number of files touched though, since IIRC it was what gave us the -142.

IMO, those are pretty good figures for a feature that every backend could use in concat + others like (#3396 (comment))

Point well-taken. The implementation is only a few hundred lines and we do stand to get future re-use out of this.

but what escape hatches can users rely on to completely subvert the promotion logic implemented here
I may not have spotted this in the code

Yeah this isn't implemented yet (see usage in (#3398) schema.py diff) but I agree would be useful to have.
So all together - if we want configuration - I'd be thinking of this as a jumping off point:

I think hatches at the call-site would be a bit better than passing messages into/out-of get_supertype. But let's cross this bridge when we get to it in the first application of this PR.

FBruzzesi and others added 30 commits January 3, 2026 20:27

WIP

0f5905d

chore: Appease ruff

bd7d0d9

chore: Appease mypy

dcd6d79

test: xfail a todo

560d577

refactor: Split out supertyping from dtypes

b40a6a9

chore: Define integer bits in the class def

dade8f2

start adding typing

5892f7a

fix: wow

36ce6ea

refactor: Use IntegerType._bits

46c21a2

skip using DTypes for IntegerTypes

ea6b143

a bit more typing-friendly

3743462

None can't return here

37ba753

refactor: No versioning needed for Float64

c2d9ddf

test: Add ids for tests

48c952d

Much easier to pick one to debug this way

change the problem to be which function

34c8088

Generate a (cached) IntegerType search space instead

3f4706d

docs: Add links for Enum

1a0f193

cheaper float compare

9a19344

cheaper int, float compare

1e64d47

add DType.__eq__ todo

2fd8800

avoid repeating binary checks

7425ecc

perf: Use min(..., key=...) and move cache for _min_time_unit

69949f0

We can safely use an unbounded `@cache`, because there can only be 16 valid pairs

why not do the same for FloatType?

9b58520

test(DRAFT): Try to get more cov

433e439

test: Almost full cov

63a1764

test: And yet more coverage

f5b65ef

docs: Add todos for temporal -> numeric

b5521e4

perf: Don't lookup the type you have already!

0074caa

Oops

refactor: Generalize _max_float

9488191

Make (Date, Datetime) preferable to Numeric

89043d1

dangotbanned added a commit that referenced this pull request Feb 7, 2026

feat: Add concat, remove HConcat.strict

63076c6

Related #3386, #3396, #3398

dangotbanned added a commit that referenced this pull request Feb 14, 2026

feat: Partial impl Resolver.join

c980246

As much as is possible without #3396

dangotbanned added a commit that referenced this pull request Feb 16, 2026

feat(DRAFT): Add some easy Function._resolve_dtypes

fd494df

Need to decide how many of the others to leave as todos Main theme is needing `get_supertype` (#3396)

Merge branch 'main' into dtypes/supertyping

bb2846f

dangotbanned added a commit that referenced this pull request Feb 17, 2026

docs: Make _resolve_dtype gaps more visible

a86e4ee

Everything left requires `get_supertype` (#3396)

dangotbanned mentioned this pull request Mar 30, 2026

Drop support for Python 3.9 #3204

Open

25 tasks

Merge branch 'main' into dtypes/supertyping

dd14fd1

MarcoGorelli requested changes Mar 30, 2026

View reviewed changes

FBruzzesi mentioned this pull request Mar 31, 2026

refactor: Simplify _integer_fits_in_decimal; disallow supercasting for Datetime and Duration with different time_unit's #3526

Open

10 tasks

This comment was marked as outdated.

Sign in to view

camriddell reviewed Apr 15, 2026

View reviewed changes

Conversation

dangotbanned commented Jan 10, 2026 • edited by FBruzzesi Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Additional use-cases

Related issues

Tasks

Footnotes

Uh oh!

MarcoGorelli left a comment

Choose a reason for hiding this comment

Uh oh!

dangotbanned commented Mar 30, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

MarcoGorelli commented Mar 30, 2026

Uh oh!

FBruzzesi commented Mar 30, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

MarcoGorelli commented Mar 31, 2026

Uh oh!

FBruzzesi commented Mar 31, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

MarcoGorelli commented Mar 31, 2026

Uh oh!

FBruzzesi commented Mar 31, 2026

Uh oh!

dangotbanned commented Mar 31, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Where we can apply the rules

Top-level functions

Expr methods

LazyFrame methods

Uh oh!

MarcoGorelli commented Mar 31, 2026

Uh oh!

dangotbanned commented Apr 1, 2026

Uh oh!

dangotbanned commented Apr 1, 2026

Uh oh!

MarcoGorelli commented Apr 1, 2026

Uh oh!

dangotbanned commented Apr 1, 2026

General

Examples

1 - fill_null

2 - coalesce

3 - join

Summary

Footnotes

Uh oh!

camriddell commented Apr 7, 2026

Uh oh!

This comment was marked as outdated.

dangotbanned commented Apr 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Backwards compatibility

General

concat(..., how="*_relaxed") (#3398)

Footnotes

Uh oh!

dangotbanned commented Apr 9, 2026

Maintainability

Uh oh!

dangotbanned commented Apr 9, 2026

LOC aside

Uh oh!

camriddell left a comment

Choose a reason for hiding this comment

Uh oh!

camriddell Apr 15, 2026

Choose a reason for hiding this comment

Uh oh!

camriddell Apr 15, 2026

Choose a reason for hiding this comment

Uh oh!

camriddell Apr 15, 2026

Choose a reason for hiding this comment

dangotbanned commented Jan 10, 2026 •

edited by FBruzzesi

Loading

dangotbanned commented Mar 30, 2026 •

edited

Loading

FBruzzesi commented Mar 30, 2026 •

edited

Loading

FBruzzesi commented Mar 31, 2026 •

edited

Loading

dangotbanned commented Mar 31, 2026 •

edited

Loading

`Expr` methods

`LazyFrame` methods

1 - `fill_null`

2 - `coalesce`

3 - `join`

dangotbanned commented Apr 9, 2026 •

edited

Loading

`concat(..., how="*_relaxed")` (#3398)

dangotbanned Apr 15, 2026 •

edited

Loading

camriddell commented Apr 15, 2026 •

edited

Loading