unicode_data

Creator: coderz1093

Last updated: October 3, 2024

0 purchases

Free

Donate

Languages

Dart

Description:

unicode data

Unicode Data #
This library puts Unicode data in a format that can be programmatically manipulated. The current implementation includes Unicode blocks and scrips data.
Background #
Unicode code points are divided into code blocks that generally contains characters within the same or related writing systems. For example Basic Latin or Arabic. However, the complete character set needed for a writing system is often spread across a number of code blocks. This character set is referred to as a script. If you want to know what writing system a particular character belongs to, it is generally more accurate to use the Unicode script data rather than the block data. You can read more about the difference here.
This library contains classes for Unicode scripts and blocks. It was generated from the Unicode 12.0 Scripts.txt and Blocks.txt data files. This library is exhaustive in that it includes every script and block in those data files.
Usage #
A simple usage example:
import 'package:unicode_data/unicode_data.dart';

main() {
unicodeBlockExamples();
unicodeScriptExamples();
}

// Unicode Blocks
void unicodeBlockExamples() {
// get a list of all blocks
List<Block> blocks = UnicodeBlock.blocks;

// find the block name for a code point
final codePoint = 'a'.runes.single;
final found = blocks
.where((block) => codePoint >= block.start && codePoint <= block.end);
final blockName = found.single.name; // Basic Latin

// get the range for a specific block name
final block = blocks.where((block) => block.name == 'Mongolian').single;
final rangeStart = block.start; // 6144
final rangeEnd = block.end; // 6319
}

// Unicode Scripts
void unicodeScriptExamples() {
// get a list of all scripts
List<Script> scripts = UnicodeScript.scripts;

// find the script name and category for a code point
final codePoint = 'a'.runes.single;
final found = scripts.where(
(script) => codePoint >= script.start && codePoint <= script.end);
final script = found.single;
final name = script.name; // Latin
final category = script.category; // L&

// find all script ranges for Latin
final latinScripts = scripts.where((script) => script.name == 'Latin');

// final all script ranges that are punctuation
final punctRanges = scripts.where((script) => script.category.startsWith('P'));
}
copied to clipboard
The category is the type of character that it is, whether a letter, punctuation or some other type.
Contributing #
Your help and pull requests are welcome.

When there is a new Unicode version, the code should be regenerated from the data files. See the generators folder in the source code.
Because there is so much data in the list, it can be potentially expensive to query the list. I would appreciate advice or examples on how to do this more efficiently. Or I am open to using a different data structure.
There are other types of Unicode data that could be included in this library in the future. You can open an issue if you have a request.

License

For personal and professional use. You cannot resell or redistribute these repositories in their original state.

Files In This Product:

There are no reviews.

unicode_data

Languages

Categories

Description:

License

Share

Files In This Product:

Overview

What you can do with it

What you can't do with it

Related Products

cupertino_icons

shared_preferences

intl

url_launcher

image_picker

More From This Creator

flutter_exts

desktop_info

structured_data

simplest

airex_flutter_plugin

unicode_data

Languages

Categories

Description:

License

Share

Files In This Product:

Customer Reviews

License

Overview

What you can do with it

What you can't do with it

Related Products

cupertino_icons

shared_preferences

intl

url_launcher

image_picker

More From This Creator

flutter_exts

desktop_info

structured_data

simplest

airex_flutter_plugin