Programs Sep, 25 >> TACU >> [ duplicator | cross-entropy | generator | suffsort | trised | xcitata ]

Program suffsort version 0.1.6

Program suffsort version 0.1.6

Initial revision 2003-02-13; Last revision 2004-05-31

1  Download
2  File readme
3  Usage and options summary
4  Description
5  Project revision history
6  License

1  Download

Sources: src/suffsort-0.1.6.tgz [33 Kb ]

Win9x-EXE (minGW cross-compiled): mingw/suffsort.zip [20 Kb ]

2  File readme

suffsort --- construction of suffix array for given file


SUPPORTED ENVIRONMENTS

http://www.gnu.org    GNU/Linux 
http://www.mingw.org  MinGW --- Minimalist GNU For Windows


COMPILATION

Enter make (or gmake) in the directory where sources reside


BRIEF INSTRUCTION

This program can be used for the construction of suffix array. If you
call

suffsort file.txt

then the suffix array will be kept in file.txt.ary, where it could be
further used by cross-entropy, for example.

License conditions are described in file LICENSE.txt


3  Usage and options summary

user@computer$ ./suffsort --help
Usage: suffsort [OPTION]... FILE
Output file: FILE.ary
  -n, --naive               naive sort
  -l, --larsson-sadakane    Larsson-Sadakane
  -r, --resilence-naive     resilence naive sort (default)

  -d, --digital             output suffix array as a digital number per line

  -q, --quiet               do not send any messages to stderr
  -h, --help                display this help and exit
  -m, --man                 display complete description
  -v, --version             display version and exit


4  Description

user@computer$ ./suffsort --man
<Usage information from the previous section is omitted>


We use Larsson-Sadakane sorting algorithm for suffix sort described in
"Faster Suffix Sorting" by N. Jesper Larsson (jesper@cs.lth.se) and
Kunihiko Sadakane (sada@is.s.u-tokyo.ac.jp). It requires 9*size(FILE)
memory. One can reduce memory requirements by switchers -n (or -r) for
naive (or resilence naive) suffix sort using system qsort function. In
the last case memory requirements decay to 5*size(FILE), at cost of
slowing by factor 4.  However, the system qsort may require a lot of
memory, in particular, in the stack which might lead to errors in
sorting


5  Project revision history

Files of the project were modified on the following dates:

2003-02-13

2003-02-18

2003-02-19

2003-05-15

2003-05-16

2004-04-10

2004-05-31

6  License

suffsort

Available at http://www.math.toronto.edu/dkhmelev/PROGS/tacu/

Author:

Dmitry V. Khmelev dkhmelev((at))math.toronto.edu [change ((at)) to @ in order to get proper address - antispam]

University of Toronto, Department of Mathematics, 100 St George Street, M5S 3G3 ON, Canada

LICENSING TERMS

This program is granted free of charge for research and education purposes. However you must obtain a license from the author to use it for commercial purposes.

Scientific results produced using the software provided shall acknowledge the use of suffsort. The proper reference is:

D. Khmelev, Text Analysis and Conversion Utilities http://www.math.toronto.edu/dkhmelev/PROGS/tacu/

Moreover shall the author of suffsort be informed about the publication.

The software must not be modified and distributed without prior permission of the author.

By using suffsort you agree to the licensing terms.

NO WARRANTY

BECAUSE THE PROGRAM IS LICENSED FREE OF CHARGE, THERE IS NO WARRANTY FOR THE PROGRAM, TO THE EXTENT PERMITTED BY APPLICABLE LAW. EXCEPT WHEN OTHERWISE STATED IN WRITING THE COPYRIGHT HOLDERS AND/OR OTHER PARTIES PROVIDE THE PROGRAM ÄS IS" WITHOUT WARRANTY OF ANY KIND, EITHER EXPRESSED OR IMPLIED, INCLUDING, BUT NOT LIMITED TO, THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE. THE ENTIRE RISK AS TO THE QUALITY AND PERFORMANCE OF THE PROGRAM IS WITH YOU. SHOULD THE PROGRAM PROVE DEFECTIVE, YOU ASSUME THE COST OF ALL NECESSARY SERVICING, REPAIR OR CORRECTION.

IN NO EVENT UNLESS REQUIRED BY APPLICABLE LAW OR AGREED TO IN WRITING WILL ANY COPYRIGHT HOLDER, OR ANY OTHER PARTY WHO MAY MODIFY AND/OR REDISTRIBUTE THE PROGRAM, BE LIABLE TO YOU FOR DAMAGES, INCLUDING ANY GENERAL, SPECIAL, INCIDENTAL OR CONSEQUENTIAL DAMAGES ARISING OUT OF THE USE OR INABILITY TO USE THE PROGRAM (INCLUDING BUT NOT LIMITED TO LOSS OF DATA OR DATA BEING RENDERED INACCURATE OR LOSSES SUSTAINED BY YOU OR THIRD PARTIES OR A FAILURE OF THE PROGRAM TO OPERATE WITH ANY OTHER PROGRAMS), EVEN IF SUCH HOLDER OR OTHER PARTY HAS BEEN ADVISED OF THE POSSIBILITY OF SUCH DAMAGES.

1  Download
2  File readme
3  Usage and options summary
4  Description
5  Project revision history
6  License

Programs Sep, 25 >> TACU >> [ duplicator | cross-entropy | generator | suffsort | trised | xcitata ]

- ???????@Mail.ru
© 2002-2005 D.Khmelev -