Add Code.Fragment.lines/1 #14493

josevalim · 2025-05-14T07:41:33Z

Inspired by String#lines in Ruby and string.splitlines(True) in Python.

lukaszsamson · 2025-05-14T07:52:15Z

lib/elixir/lib/string.ex

+  defp lines(<<?\n, rest::binary>>, acc),
+    do: [<<acc::binary, ?\n>> | lines(rest, <<>>)]
+
+  defp lines(<<char, rest::binary>>, acc),


Shouldn't we handle /r as well? the python version does:

"asd\radf".splitlines(True) ['asd\r', 'adf']

Side note: LSP document synchronisation treats /r as legit newlines as well

The Ruby does one not and I am skeptical about doing so because it is clearly not a newline on terminals:

$ iex Erlang/OTP 27 [erts-15.1.2] [source] [64-bit] [smp:10:10] [ds:10:10:10] [async-threads:1] [jit] Interactive Elixir (1.19.0-dev) - press Ctrl+C to exit (type h() ENTER for help) iex(1)> IO.puts "foo\rbar" bar :ok

So treating it as a newline looks very Windows centric. :)

Perhaps support the Unicode Line Separator? That would seem consistent with String.split/1 which uses the Unicode definition of whitespace.

So treating it as a newline looks very Windows centric. :)

Wasn't it classic MacOS?

Perhaps support the Unicode Line Separator? That would seem consistent with String.split/1 which uses the Unicode definition of whitespace.

python does support it

"asd adf".splitlines(True) ['asd\u2028', 'adf']

Python has it's own rules https://docs.python.org/3/library/stdtypes.html#str.splitlines basing on https://peps.python.org/pep-0278/ and https://peps.python.org/pep-3116/

And they quite recently added \v and \f to list of line boundaries (in 3.2)

And then you have Erlang's interpretation of newlines, which only considers \r\n and \n as well:

~/OSS/elixir[jv-string-lines *%]$ cat example | elixir -e "IO.stream() |> Enum.each(&IO.inspect/1)" "LINE\n" "LINE\rLINE\n" <<76, 73, 78, 69, 12, 76, 73, 78, 69, 11, 76, 73, 78, 69, 194, 133, 76, 73, 78, 69, 226, 128, 168, 76, 73, 78, 69, 226, 128, 169, 76, 73, 78, 69>>

So I am thinking the best is to find a new home for this function. Perhaps the Code module indeed.

I still use Textmate often and it shows something very similar to Zed.

That's even more conservative as it only shows \n but I assume there is a configuration somewhere for it to read it Windows style.

Thank you for the feedback. I have renamed it to Code.lines/1, allowing us to mirror what the Elixir compiler does.

sabiwara · 2025-05-20T07:13:45Z

lib/elixir/lib/code.ex

+  considered, namely `\r\n` and `\n`. If you would like the retrieve
+  lines without their line endings, use `String.split(string, ["\r\n", "\n"])`.


It feels like the disclaimer about ending will narrow the use case for this quite a bit?
Perhaps we could add a trim: true option later to remove the ending, in which case it would make it more generally useful.
Which kind of use cases do we have in mind for this?

The use case is to fetch a range from a source file but preserving its original source code. I can add trim: true but trim typically means trimming more (spaces, tabs, etc). It is also something folks can do by doing another pass on the data.

josevalim · 2025-05-22T13:40:59Z

I am thinking about moving this to Code.Fragment instead... as Code is quite bloated already and it is usually about evaluating code (not treating Code as text).

josevalim · 2025-05-28T09:04:34Z

💚 💙 💜 💛 ❤️

josevalim added 2 commits May 14, 2025 09:39

Add String.lines/1

8ffaf5a

Private

17e41d0

lukaszsamson reviewed May 14, 2025

View reviewed changes

Move to Code.lines/1

c7c21bf

josevalim changed the title ~~Add String.lines/1~~ Add Code.lines/1 May 17, 2025

sabiwara reviewed May 20, 2025

View reviewed changes

Move to Code.Fragment

f70f2ad

josevalim changed the title ~~Add Code.lines/1~~ Add Code.Fragment.lines/1 May 28, 2025

josevalim merged commit 7f1fe47 into main May 28, 2025
24 checks passed

josevalim deleted the jv-string-lines branch May 28, 2025 09:04

Nezteb pushed a commit to Nezteb/elixir that referenced this pull request Jun 2, 2025

Add Code.Fragment.lines/1 (elixir-lang#14493)

b63993c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add Code.Fragment.lines/1 #14493

Add Code.Fragment.lines/1 #14493

Uh oh!

josevalim commented May 14, 2025

Uh oh!

lukaszsamson May 14, 2025

Uh oh!

josevalim May 14, 2025

Uh oh!

kipcole9 May 14, 2025

Uh oh!

lukaszsamson May 14, 2025

Uh oh!

lukaszsamson May 14, 2025

Uh oh!

lukaszsamson May 14, 2025 •

edited

Loading

Uh oh!

josevalim May 14, 2025

Uh oh!

kipcole9 May 14, 2025

Uh oh!

josevalim May 14, 2025

Uh oh!

josevalim May 17, 2025

Uh oh!

sabiwara May 20, 2025

Uh oh!

josevalim May 20, 2025

Uh oh!

josevalim commented May 22, 2025

Uh oh!

Uh oh!

josevalim commented May 28, 2025

Uh oh!

Uh oh!

		considered, namely `\r\n` and `\n`. If you would like the retrieve
		lines without their line endings, use `String.split(string, ["\r\n", "\n"])`.

Add Code.Fragment.lines/1 #14493

Add Code.Fragment.lines/1 #14493

Uh oh!

Conversation

josevalim commented May 14, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

lukaszsamson May 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

josevalim commented May 22, 2025

Uh oh!

Uh oh!

josevalim commented May 28, 2025

Uh oh!

Uh oh!

lukaszsamson May 14, 2025 •

edited

Loading