登录查看更多内容

Understanding String Slicing in #Rust

Jesús Flores

Senior Software Engineer at Factor Eleven | Video Game DM at Stone Goblin Games

发布日期: 2024年9月9日

One important thing to remember when slicing strings in #Rust is that slices are based on bytes, not characters. This distinction means that slicing ASCII strings is not the same as slicing multibyte (Unicode) strings. Here's an example of code that doesn't compile due to this difference:

fn main() {
    let ascii_string = "foobar";
    let multibyte_string = "Espa?a";

    let length_in_bytes_ascii = ascii_string.len();
    let length_in_bytes_multibyte = multibyte_string.len();

    // Slicing strings
    let slice_ascii = &ascii_string[..3]; // This works fine
    let slice_multibyte = &multibyte_string[..5]; // This will panic at runtime

    println!("The length of the ASCII string in bytes is: {}", length_in_bytes_ascii);
    println!("The length of the multibyte string in bytes is: {}", length_in_bytes_multibyte);

    println!("The ASCII slice is: {}", slice_ascii);
    println!("The multibyte slice is: {}", slice_multibyte);
}

In the above code:

ascii_string is a simple ASCII string where each character is 1 byte.
multibyte_string is a Unicode string, and some characters (like "?") use more than 1 byte.

When we try to slice multibyte_string using a range that doesn’t align with character boundaries (like &multibyte_string[..5]), Rust will panic because slicing multibyte characters improperly could lead to invalid UTF-8.

Lesson: Always ensure your slices align with valid UTF-8 character boundaries when working with multibyte strings in Rust!

要查看或添加评论，请登录

Jesús Flores的更多文章

Refactoring Legacy PHP Code with the Mediator Pattern: A Journey from Chaos to Structure

2024年9月22日

Refactoring Legacy PHP Code with the Mediator Pattern: A Journey from Chaos to Structure

We've all been there: staring at a massive, monolithic function that's been haunting the codebase since the early days.…

2 条评论
Rust Developers: Beware of 'as' Conversions Between Different Data Types

2024年9月11日

Rust Developers: Beware of 'as' Conversions Between Different Data Types

When working with Rust, developers often need to convert between different data types. The as keyword is a quick and…
Computational Investment in Python - II

2017年6月14日

Computational Investment in Python - II

Notebook 2 Hello fellow quants! In the previous article we had a very brief introduction to the jupyter notebook tool…
Computational Investment in Python - I

2017年5月27日

Computational Investment in Python - I

Hello and welcome to the first article about 'Computational Investment in Python'. The purpose of this course for me is…
Deep learning for dummies

2016年4月13日

Deep learning for dummies

If you ever wondered about how deep learning works this article brings an easy approach for all the dummies like…

See all articles

Understanding String Slicing in #Rust

Jesús Flores

Senior Software Engineer at Factor Eleven | Video Game DM at Stone Goblin Games

Jesús Flores的更多文章

社区洞察

其他会员也浏览了

Lightbulb moments start with the simplest inquiries

Reducing Errors by Writing Small Functions with Domain-Inspired Types

Which Chart to use when - Line Chart_Part -1

Diamonds in C

Unit Test (Matcher Function)

?? Day67 of #100DaysOfPython ??

LeetCode Medium Challenge Day 11 | Optimal Partition of String

Basket Option Pricer - Details

?? Day31 of #100DaysOfPython ??

Construct Binary Tree from Preorder and Inorder Traversal

Jesús Flores的更多文章

Refactoring Legacy PHP Code with the Mediator Pattern: A Journey from Chaos to Structure

Rust Developers: Beware of 'as' Conversions Between Different Data Types

Computational Investment in Python - II

Computational Investment in Python - I

Deep learning for dummies

社区洞察

其他会员也浏览了

Lightbulb moments start with the simplest inquiries

Reducing Errors by Writing Small Functions with Domain-Inspired Types

Which Chart to use when - Line Chart_Part -1

Diamonds in C

Unit Test (Matcher Function)

?? Day67 of #100DaysOfPython ??

LeetCode Medium Challenge Day 11 | Optimal Partition of String

Basket Option Pricer - Details

?? Day31 of #100DaysOfPython ??

Construct Binary Tree from Preorder and Inorder Traversal