aboutsummaryrefslogtreecommitdiffstats
path: root/src/coding.c (follow)
Commit message (Collapse)AuthorAgeFilesLines
* Commentary changes.Dave Love2001-01-041-64/+68
| | | | (detect_eol_type_in_2_octet_form): Declare arg big_endian_p.
* (detect_coding_iso2022, setup_coding_system): RemoveEli Zaretskii2001-01-021-14/+5
| | | | | | | | | | | | | | | unused variable `i'. (detect_coding_mask): Remove unused variable `idx'. (detect_coding): Remove unused variable `i'. (ccl_coding_driver): Remove unused variable `result'. (run_pre_post_conversion_on_str): Remove unused variable `prev'. (decode_coding_string): Remove unused variables `to' and `gcpro1'. (encode_coding_string): Remove unused variables `gcpro1' and `saved_coding_symbol'. (Ffind_coding_systems_region_internal): Remove function-local variable args[], leave only the block-local one. (code_convert_region1): Remove unused variable `len'.
* (setup_coding_system): InitializeKenichi Handa2000-12-281-2/+56
| | | | | coding->spec.ccl.eight_bit_carryover. (ccl_coding_driver): Pay attention to carried over 8-bit bytes.
* (SAFE_ONE_MORE_BYTE): New macro.Kenichi Handa2000-12-281-36/+439
| | | | | | | | | | | | | (DECODE_EMACS_MULE_COMPOSITION_CHAR): New macro. (DECODE_EMACS_MULE_COMPOSITION_RULE): New macro. (decode_composition_emacs_mule): New function. (decode_coding_emacs_mule): Decode composition sequence by calling decode_composition_emacs_mule. (ENCODE_COMPOSITION_EMACS_MULE): New macro. (encode_coding_emacs_mule): Changed from macro to function. If a text contains compostions, encode them correctly. (setup_coding_system): Set coding->commong_flags for emacs-mule so that decoding and encoding are required.
* (ccl_coding_driver): Initialize ccl->cr_consumed.Kenichi Handa2000-12-211-2/+12
|
* (detect_coding): Call detect_coding_mask with a correct MULTIBYTEPKenichi Handa2000-12-151-4/+2
| | | | | | argument. (code_convert_region): Don't override coding->src_multibyte and coding->dst_multibyte.
* (setup_coding_system): Clear all members of the structKenichi Handa2000-12-151-2/+3
| | | | coding_system at first.
* (decode_coding_string): Set coding->src_multibyte andKenichi Handa2000-12-141-3/+7
| | | | | coding->dst_multibyte before calling detect_coding and detect_eol. Update them after some coding system is detected.
* (ONE_MORE_BYTE_CHECK_MULTIBYTE): New macro.Kenichi Handa2000-12-131-53/+87
| | | | | | | | | (detect_coding_emacs_mule, detect_coding_iso2022,) (detect_coding_sjis, detect_coding_big5, detect_coding_utf_8) (detect_coding_utf_16, detect_coding_ccl): Make them static. New argument MULTIBYTEP. Callers changed. (detect_coding_mask, detect_coding_system): New argument MULTIBYTEP. Callers changed.
* (setup_coding_system): Be sure to initialize coding->category_idx.Kenichi Handa2000-12-051-0/+6
|
* (Ffind_coding_systems_region_internal): Be sure toKenichi Handa2000-11-281-1/+3
| | | | include no-conversion.
* (decode_coding_emacs_mule): Fix the case ofEli Zaretskii2000-11-231-2/+1
| | | | CODING_EOL_LF, which used an uninitialized value of c.
* (decode_coding): Fix previous change (check alsoKenichi Handa2000-10-261-0/+1
| | | | CODING_MODE_LAST_BLOCK).
* (decode_coding_emacs_mule): If coding->eol_type is CREli Zaretskii2000-10-241-1/+40
| | | | or CRLF, decode EOLs.
* Fix a typo in a comment.Eli Zaretskii2000-10-221-1/+1
|
* (decode_coding_string): Set coding->src_multibyte andKenichi Handa2000-10-131-8/+8
| | | | | | coding->dst_multibyte before using CODING_REQUIRE_DECODING. (encode_coding_string): Set coding->src_multibyte and coding->dst_multibyte before using CODING_REQUIRE_ENCODING.
* (code_convert_region): Be sure to initialize coding->category_idx.Kenichi Handa2000-10-131-5/+8
|
* (detect_coding_sjis): Check the byte sequence more regidly.Kenichi Handa2000-09-071-3/+8
|
* (encode_coding_sjis_big5): Fix previous change.Kenichi Handa2000-09-071-0/+2
|
* (encode_coding_sjis_big5): Use translation table forKenichi Handa2000-09-071-5/+5
| | | | | | encoding, not decoding. Check for the charset katakana-jisx0201, not latin-jisx0201. (ONE_MORE_CHAR): Call translate_char with CHARSET arg -1.
* (encode_eol): Fix bug for the case of dst_bytes being zero. SetKenichi Handa2000-08-231-1/+2
| | | | coding->produced_char correctly.
* (encode_coding_string): Fix previous change.Kenichi Handa2000-08-181-0/+1
|
* (decode_coding_string): Set members consumed,Kenichi Handa2000-08-181-8/+27
| | | | | | consumed_char, produced, produced_char of *coding correctly. If decode_coding doesn't consume any byte, don't try anymore. (encode_coding_string): Likewise.
* (encode_coding): Fix the bug of not flushing ISO escape sequence atKenichi Handa2000-08-161-5/+5
| | | | the end of the source block.
* (MINIMUM_CONVERSION_BUFFER_SIZE): Macro deleted.Kenichi Handa2000-08-161-80/+150
| | | | | | | | | | | | | | | | | | | | | | (conversion_buffer, conversion_buffer_size): Variables deleted. (get_conversion_buffer): Function deleted. (struct conversion_buffer): New structure. (MAX_ALLOCA): New macro. (allocate_conversion_buffer): New macro. (extend_conversion_buffer, free_conversion_buffer): New functions. (ccl_coding_driver): Set coding->result. (decode_coding): Set coding->result to CODING_FINISH_NORMAL if this is the last block of source. (encode_coding): Likewise. Handle the source block as the last one only when the whole source text is consumed. (decode_coding_string): Handle the case that the output buffer is too small to decode the whole source text. Use allocate_conversion_buffer, extend_conversion_buffer and free_conversion_buffer, not get_conversion_buffer. (encode_coding_string): Likewise. (init_coding): Function deleted. (init_coding_once): Delete code to initialize conversion_buffer_size.
* (ccl_coding_driver): Initialize ccl->multibyte.Kenichi Handa2000-08-111-0/+1
|
* *coding.c (find_safe_codings): CHAR_TABLE_SET index must be a lisp object.Ken Raeburn2000-08-081-2/+2
| | | | | (Ffind_coding_systems_region_internal): First argument to Fappend must be an integer, not a lisp object.
* (coding_restore_composition): Pay attention to the case thatKenichi Handa2000-08-041-20/+32
| | | | | | | | cmp_data is not set properly (because of invalid code in the source text). (run_pre_post_conversion_on_str): Include text properties in the resulting string. (decode_coding_string): Set members of coding correctly.
* (decode_coding_iso2022): More strict check for handling singleKenichi Handa2000-08-041-0/+4
| | | | shifting.
* (Qsafe_charsets): This variable deleted.Kenichi Handa2000-07-271-87/+267
| | | | | | | | | | | | | | | | | | | | | | | | | | | (Qsafe_chars, Vchar_coding_system_table, Qchar_coding_system): New variables. (coding_safe_chars): New function. (CODING_SAFE_CHAR_P): New macro. (CHARSET_OK): New arg C. Call CODING_SAFE_CHAR_P instead of checking safe_charsets member of the coding system. Caller changed. (detect_coding_iso2022): New local variable safe_chars. (DECODE_DESIGNATION): Call CODING_SAFE_CHAR_P instead of checking safe_charsets member of the coding system. (decode_coding_iso2022): New local variable safe_chars. (ENCODE_ISO_CHARACTER_DIMENSION1): Don't check unsafe chars here. (ENCODE_ISO_CHARACTER_DIMENSION2): Likewise. (ENCODE_ISO_CHARACTER): Arguments changed. Caller changed. (ENCODE_UNSAFE_CHARACTER): New macro. (encode_coding_iso2022): New local variable safe_chars. Check unsafe chars. (setup_coding_system): Delete the code to initialize coding->safe_charses (intersection, find_safe_codings): New functions. (Ffind_coding_systems_region_internal): New function. (syms_of_coding): Defsubr it. Initialize Qsafe_chars, Qsafe_cding_system. Make Vchar_coding_system_table a Lisp variable and initialize it.
* (syms_of_coding): Doc fix for inhibit-iso-escape-detection.Eli Zaretskii2000-07-231-7/+7
|
* (code_convert_region): Delete text properties before shrinking theKenichi Handa2000-07-191-11/+9
| | | | conversion region.
* (syms_of_coding): Fix typo in spelling of variableGerd Moellmann2000-07-181-1/+1
| | | | `inhibit-iso-escape-detection'.
* (setup_coding_system): Don't override the explicitly specifiedKenichi Handa2000-07-181-2/+4
| | | | designations.
* (inhibit_iso_escape_detection): New variable.Kenichi Handa2000-07-141-0/+37
| | | | | | (syms_of_coding): Make it a Lisp variable. (detect_coding_iso2022): If inhibit_iso_escape_detection is nonzero, ignore ISO2022's escape sequence.
* (code_convert_region): Even if the length of text isKenichi Handa2000-06-301-38/+41
| | | | | zero, try to convert it if coding->type is coding_type_ccl. (decode_coding_string, encode_coding_string): Likewise.
* (decode_coding_string): Re-fetch STRING_BYTES afterAndreas Schwab2000-06-261-0/+1
| | | | Fstring_as_unibyte.
* (decode_eol_post_ccl): Special handling for undecidedEli Zaretskii2000-06-231-4/+10
| | | | and inconsistent EOL types.
* (setup_coding_system) <4>: Reset member `cr_carryover'.Kenichi Handa2000-06-191-5/+152
| | | | | | | | | | | | (ccl_coding_driver): On encoding, initialize ccl->eol_type. (decode_eol_post_ccl): New function. (decode_coding): Don't detect EOL format here for CCL based coding systems. (decode_coding) <coding_type_ccl>: Handle carryovered CR. Call decode_eol_post_ccl after running the CCL program. (code_convert_region): Don't detect EOL format here for CCL based coding systems. (decode_coding_string): Likewise.
* (decode_coding_emacs_mule): Always set src_base at theKenichi Handa2000-06-151-2/+1
| | | | start of the while loop.
* (detect_coding_iso2022): Fix code for checkingKenichi Handa2000-05-301-3/+1
| | | | CODING_CATEGORY_MASK_ISO_8_2.
* (DECODE_COMPOSITION_START): If coding->cmp_data is notKenichi Handa2000-05-291-45/+46
| | | | | | | yet allocated, finish decoding with result CODING_FINISH_INSUFFICIENT_CMP. (coding_allocate_composition_data): Make it non-static. (coding_restore_composition): Likewise.
* (shrink_decoding_region): Initialize eol_conversion.Dave Love2000-05-261-1/+4
|
* Fix junk from last checkin.Dave Love2000-05-251-1/+1
|
* (encode_eol): Add null statement after label.Dave Love2000-05-251-0/+3
|
* (run_pre_post_conversion_on_str): Set point to theKenichi Handa2000-05-251-3/+6
| | | | | | | beginning of buffer before calling coding->post_read_conversion. (decode_coding_string): Give correct args to run_pre_post_conversion_on_str. (encode_coding_string): Likewise.
* (encode_eol): Fix a bug of DOS style EOL encoding.Kenichi Handa2000-05-231-0/+1
|
* (ONE_MORE_BYTE, TWO_MORE_BYTES): Set coding->resutl toKenichi Handa2000-05-191-1363/+1220
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | CODING_FINISH_INSUFFICIENT_SRC if there's not enough source. (ONE_MORE_CHAR, EMIT_CHAR, EMIT_ONE_BYTE, EMIT_TWO_BYTE, EMIT_BYTES): New macros. (THREE_MORE_BYTES, DECODE_CHARACTER_ASCII, DECODE_CHARACTER_DIMENSION1, DECODE_CHARACTER_DIMENSION2): These macros deleted. (CHECK_CODE_RANGE_A0_FF): This macro deleted. (detect_coding_emacs_mule): Use UNIBYTE_STR_AS_MULTIBYTE_P to check the validity of multibyte sequence. (decode_coding_emacs_mule): New function. (encode_coding_emacs_mule): New macro. (detect_coding_iso2022): Use ONE_MORE_BYTE to fetch a byte from the source. (DECODE_ISO_CHARACTER): Just return a character code. (DECODE_COMPOSITION_START): Set coding->result instead of result. (decode_coding_iso2022, decode_coding_sjis_big5, decode_eol): Use EMIT_CHAR to produced decoded characters. Exit the loop only by macros ONE_MORE_BYTE or EMIT_CHAR. Don't handle the case of last block here. (ENCODE_ISO_CHARACTER): Don't translate character here. Produce only position codes for an invalid character. (encode_designation_at_bol): Return new destination pointer. 5th arg DSTP is changed to DST. (encode_coding_iso2022, decode_coding_sjis_big5): Get a character from the source by ONE_MORE_CHAR. Don't handle the case of last block here. (DECODE_SJIS_BIG5_CHARACTER, ENCODE_SJIS_BIG5_CHARACTER): These macros deleted. (detect_coding_sjis, detect_coding_big5, detect_coding_utf_8, detect_coding_utf_16, detect_coding_ccl): Use ONE_MORE_BYTE and TWO_MORE_BYTES to fetch a byte from the source. (encode_eol): Pay attention to coding->src_multibyte. (detect_coding, detect_eol): Preserve members src_multibyte and dst_multibyte. (DECODING_BUFFER_MAG): Return 2 even for coding_type_raw_text. (encoding_buffer_size): Set magnification to 3 for all coding systems that require encoding. (ccl_coding_driver): For decoding, be sure that the result is valid multibyte sequence. (decode_coding): Initialize coding->errors and coding->result. For emacs-mule, call decode_coding_emacs_mule. For no-conversion and raw-text, always call decode_eol. Handle the case of last block here. If not coding->dst_multibyte, convert the resulting sequence to unibyte. (encode_coding): Initialize coding->errors and coding->result. For emacs-mule, call encode_coding_emacs_mule. For no-conversion and raw-text, always call encode_eol. Handle the case of last block here. (shrink_decoding_region, shrink_encoding_region): Detect cases that we can't skip data more rigidly. (code_convert_region): Setup src_multibyte and dst_multibyte members of coding. For decoding, if the buffer is multibyte, convert the source sequence to unibyte in advance. For encoding, if the buffer is multibyte, convert the resulting sequence to multibyte afterward. (run_pre_post_conversion_on_str): New function. (code_convert_string): Deleted and divided into the following two. (decode_coding_string, encode_coding_string): New functions. (code_convert_string1, code_convert_string_norecord): Call one of above. (Fdecode_sjis_char, Fdecode_big5_char): Use MAKE_CHAR instead of MAKE_NON_ASCII_CHAR. (Fset_terminal_coding_system_internal, Fset_safe_terminal_coding_system_internal): Setup src_multibyte and dst_multibyte members. (init_coding_once): Initialize iso_code_class with new enum ISO_control_0 and ISO_control_1.
* * coding.h (code_convert_string): Declare.Ken Raeburn2000-04-081-1/+1
| | | | | * coding.c (code_convert_string_norecord): Pass an int, not a lisp object, as the fourth argument to code_convert_string.
* Add comments on coding-category-utf-8,Kenichi Handa2000-03-071-41/+258
| | | | | | | | | | | | | | | | | | | | | | coding-category-utf-16-be, and coding-category-utf-16-le. (coding_category_name): Include "coding-category-utf-8", "coding-category-utf-16-be", and "coding-category-utf-16-le". (UTF_8_1_OCTET_P) (UTF_8_EXTRA_OCTET_P) (UTF_8_2_OCTET_LEADING_P) (UTF_8_3_OCTET_LEADING_P) (UTF_8_4_OCTET_LEADING_P) (UTF_8_5_OCTET_LEADING_P) (UTF_8_6_OCTET_LEADING_P): New macros. (detect_coding_utf_8): New function. (UTF_16_INVALID_P) (TF_16_HIGH_SURROGATE_P) (UTF_16_LOW_SURROGATE_P): New macros. (detect_coding_utf_16): New function. (detect_coding_mask): Fix bug of returning wrong mask bits in the case that detect_coding_XXX returns a mask not set in priorities[i]. (detect_eol_type_in_2_octet_form): New function. (detect_eol): If cooding->category_idx is for UTF-16, call detect_eol_type_in_2_octet_form instead of dectect_eol_type. (detect_coding_system): Don't include `nil' coding-system in the result. (Fupdate_coding_systems_internal): Update all coding-categories.