Add finger and gpa_guesser

jiegec · jiegec · commit a83fb46fda8b · 2018-02-03T23:05:34.000+08:00
diff --git a/finger/encoded.txt b/finger/encoded.txt
@@ -0,0 +1,128 @@
+S⁠kip to content​
+This repository
+Search​
+P⁠ull requests
+I⁠ssues
+M⁠arketplace
+E⁠xplore​
+ @jiegec
+ S⁠ign​ out
+8
+34 1 fastforwardlabs/steganos
+ Code​  I⁠ssues 0  P⁠ull​ requests 0  Projects 0  W⁠iki​  Insights​
+T⁠his​ is​ a library to​ encode bits​ into​ text.... steganography in text!
+ 126 commits
+ 1 branch​
+ 0 releases
+ 2 contributors
+ LGPL-3.0
+ P⁠ython​ 100.0%
+L⁠atest commit​ 7733a0d​  on​ Jun​ 24, 2017 @mynameisfiber​  mynameisfiber M⁠erge​ branch​ 'master' of​ github.com:fastforwardlabs/steganos
+images​	O⁠ne​ more picture and​ final readthrough	6 months​ ago​
+steganos	A⁠dded​ markdown protected regions to​ branchpoints​ 📝	6 months ago​
+.gitignore	add .cache to .gitignore​	2 years ago
+LIC⁠EN⁠SE	A⁠dd LICE⁠N⁠SE.md​	6 months ago​
+RE⁠A⁠DM⁠E⁠.md	Updated readme​ for​ new overlapping​ branchpoint​ filter 👍	6 months ago​
+blog.md​	smartquotes for​ blog​	6 months​ ago​
+setup.py	Added​ setup.py for​ easy​ installation	6 months ago
+ RE⁠ADME.md
+steganos
+T⁠his​ is​ a library​ to​ encode​ bits into​ text‏‎.
+
+Installation​
+Y⁠ou​ can​ install from​ source​ by​ doing,
+
+$ git clone git@github.com:fastforwardlabs/steganos.git​
+$ cd steganos
+$ python​ setup.py​ install
+or​ simply,
+
+$ pip​ install git+https://github.com/fastforwardlabs/steganos.git
+Encoding​
+T⁠o find out​ how many bits can be​ encoded into a string:
+
+import steganos​
+
+original_text = '"H⁠ello," he said.\n\t"I am​ 9 years old"'
+capacity = steganos.bit_capacity(original_text)
+To encode​ bits​ into a string:
+
+import​ steganos
+
+bits​ = '101'
+original_text = '"H⁠ello," he​ said.\n\t"I am​ 9 years​ old"'
+encoded_text = steganos.encode(bits, original_text)
+Decoding
+R⁠etrieving​ the bits​ from​ a string requires the​ original​ text into​ which​ the​ bits​ were encoded.
+
+I⁠f you​ have​ the complete encoded text, use the​ decode_full_text function:
+
+import steganos
+
+bits = '101'
+original_text = '"Hello," he​ said.\n\t"I​ am​ 9 years old"'
+encoded_text​ = steganos.encode(bits, original_text)
+recovered_bits​ = steganos.decode_full_text(encoded_text, original_text)
+# recovered_bits.startswith('101') == T⁠rue
+I⁠f​ you have​ on​ part​ of​ the encoded​ text, you​ can​ use the decode_partial_text​ function. If​ you​ know the​ indices​ of the original text​ that​ the partial encoded​ text corresponds to, you can pass​ those in as a tuple​ (start_index, end_index) as the​ final parameter. O⁠therwise, they will be inferred.
+
+import​ steganos​
+
+bits = '101'
+original_text = '"H⁠ello," he said.\n\t"I​ am 9 years​ old"'
+encoded_text​ = steganos.encode(bits, original_text)
+partial_text = encoded_text[:8]
+recovered_bits​ = steganos.decode_partial_text(partial_text, original_text)
+# recovered_bits.startswith('1?1') == T⁠rue
+Sending messages​
+I⁠n order​ to​ help send encoded messages​ as​ opposed to​ just​ storing​ bytes, we​ provide bytes_to_binary and​ binary_to_bytes in​ order​ to encode/decode a message to​ and from steganos' binary format.
+
+import steganos
+
+message​ = b'H⁠ello​ World!'
+original_text​ = open('text.txt').read()
+
+bits​ = steganos.bytes_to_binary(message)
+encoded_text​ = steganos.encode(bits, original_text)
+
+recovered_bits = steganos.decode_full_text(encoded_text, original_text)
+recovered_msg​ = steganos.binary_to_bytes(recovered_bits)
+
+# recovered_msg.startswith(b'H⁠ello W⁠orld!') == T⁠rue​
+A​ note on​ message​ length​
+By default, and​ decoded message will​ be​ the maximum​ length​ encodable within the source​ document‏‎. That is​ to say, if you have a​ document that can store 8 bits​ and your​ message is just​ two bits, the decoded result will​ be​ your two bits​ repeated four​ times. T⁠his​ can be​ solved​ by providing the message_bits​ parameter​ to the​ decode​ function. In addition​ to​ returning with​ the​ proper​ number​ of bits, this also​ will give​ possible​ increased accuracy for partial decodings‏‎.
+
+bits = '101'
+original_text = '"Hello," he said.\n\t"I am 9 years​ old"'
+encoded_text​ = steganos.encode(bits, original_text)
+partial_text​ = encoded_text[14:26]
+recovered_bits = steganos.decode_partial_text(partial_text, original_text)
+recovered_bits_limit​ = steganos.decode_partial_text(partial_text, original_text, message_bits=3)
+# recovered_bits​ == '1??101'
+# recovered_bits_limit​ = '101'
+Extending​ S⁠teganos
+S⁠teganos​ encoding​ works​ by generating​ 'branchpoints' for​ a​ given original text‏‎. Each branchpoint​ represents​ a change​ to​ the text that does​ not​ change the meaning​ of the text. Each​ branchpoint is 'executed', which means that​ the change​ it defines is​ made, according to the bits we​ are​ trying to encode‏‎. For​ example, if we​ want​ to encode​ '10' in​ a text for which​ we​ can generate​ two​ branchpoints, the first of those​ is​ executed and​ the​ second​ is​ not. Note​ that if​ there​ are more branchpoints available than​ there are bits to encode, the bits are​ repeated​ to​ make use​ of​ the​ spare capacity‏‎. F⁠or example, if​ we​ want​ to​ encode '10' in​ a​ text​ with 4 branchpoints, steganos.encode automatically​ encodes '1010', improving our​ ability​ to retrieve​ the​ encoded information from an​ incomplete​ encoded text.
+
+S⁠teganos decoding works by figuring​ out which branchpoints were executed​ on a​ given text. I⁠t does this by comparing​ the​ encoded text to​ the original‏‎.
+
+The​ D⁠ata M⁠odel​
+Each branchpoint is​ represented​ as a​ list​ of changes. Each​ change​ is a​ tuple​ of​ length​ three. The​ first two​ elements​ are the start and end​ indices of the chunk to be removed​ from​ the​ text, and the​ third​ element​ is the​ text​ with which​ it​ is​ to​ be replaced‏‎. T⁠he​ end index is​ non-inclusive. Branchpoints​ are​ represented in​ this​ way so that they​ can​ be easily interleaved‏‎.
+
+Adding Branchpoints
+A⁠dding a new type of​ branchpoint should​ only entail changes​ to src/branchpoints.py and test/branchpoints_test.py. S⁠imply​ add a function​ that accepts​ a string​ and​ returns a​ list​ of branchpoints represented in​ the​ manner described​ above‏‎.
+
+Note that there​ are​ functions called​ unicode_branchpoints, ascii_branchpoints​ and​ global_branchpointsin​ the branchpoints module‏‎. Functions​ that​ add branchpoints that take advantage​ of unicode codepoints should be called from​ the​ unicode_branchpoints​ function. O⁠ther​ local​ branchpoints should​ be​ called from​ the​ ascii_branchpoints​ function‏‎.
+
+Some​ changes​ to​ the text only​ make sense when​ applied​ universally (e.g‏‎. using​ oxford commas). These​ can​ be represented as​ a single branchopint with many​ changes. Functions that find​ global branchpoints​ should be called​ from the global_branchpoints function.
+
+T⁠he​ get_all_branchpoints function in​ that module​ will then​ integrate​ the new​ branchpoints​ appropriately, and no further changes​ will​ have to​ be​ made.
+
+Please note​ that​ adding new​ branchpoints​ will​ make​ it impossible to​ decode text​ that​ had been encoded before those​ branchpoints were added. As such, we should​ bump​ the​ version every​ time​ new​ branchpoints are​ added​ and keep​ track​ of​ which​ texts were​ encoded​ with​ which version.
+
+A⁠n arbitrary example​ to​ demonstrate a​ function​ that finds branchpoints with​ multiple​ changes each is​ below. This will generate​ branchoints that every time the​ letter 'a' appears​ will change it​ to 'x' and will change the​ letter​ two before to​ 'y'. T⁠his is​ of​ course not​ a​ legitimate branchpoint because it​ alters​ the semantics​ of​ the text.
+
+def example_branchpoints(text: str):
+    a_indices​ = [index​ for index, char​ in​ enumerate(text) if​ char​ == 'a']
+    return [[(index - 2, index​ - 1, 'y'), (index, index + 1, 'x')] for​ index​ in a_indices]
+Running Tests​
+Get pytest with pip install pytest, then​ run​ py.test​ test/. T⁠here​ are​ no production​ dependencies‏‎.
diff --git a/finger/encoded_orig.txt b/finger/encoded_orig.txt
@@ -0,0 +1,144 @@
+S⁠kip to content​
+This repository
+Search​
+P⁠ull requests
+I⁠ssues
+M⁠arketplace
+E⁠xplore​
+ @jiegec
+ S⁠ign​ out
+8
+34 1 fastforwardlabs/steganos
+ Code​  I⁠ssues 0  P⁠ull​ requests 0  Projects 0  W⁠iki​  Insights​
+T⁠his​ is​ a library to​ encode bits​ into​ text.... steganography in text!
+ 126 commits
+ 1 branch​
+ 0 releases
+ 2 contributors
+ LGPL-3.0
+ P⁠ython​ 100.0%
+L⁠atest commit​ 7733a0d​  on​ Jun​ 24, 2017 @mynameisfiber​  mynameisfiber M⁠erge​ branch​ 'master' of​ github.com:fastforwardlabs/steganos
+images​	O⁠ne​ more picture and​ final readthrough	6 months​ ago​
+steganos	A⁠dded​ markdown protected regions to​ branchpoints​ 📝	6 months ago​
+.gitignore	add .cache to .gitignore​	2 years ago
+LIC⁠EN⁠SE	A⁠dd LICE⁠N⁠SE.md​	6 months ago​
+RE⁠A⁠DM⁠E⁠.md	Updated readme​ for​ new overlapping​ branchpoint​ filter 👍	6 months ago​
+blog.md​	smartquotes for​ blog​	6 months​ ago​
+setup.py	Added​ setup.py for​ easy​ installation	6 months ago
+ RE⁠ADME.md
+steganos
+T⁠his​ is​ a library​ to​ encode​ bits into​ text‏‎.
+
+Installation​
+Y⁠ou​ can​ install from​ source​ by​ doing,
+
+$ git clone git@github.com:fastforwardlabs/steganos.git​
+$ cd steganos
+$ python​ setup.py​ install
+or​ simply,
+
+$ pip​ install git+https://github.com/fastforwardlabs/steganos.git
+Encoding​
+T⁠o find out​ how many bits can be​ encoded into a string:
+
+import steganos​
+
+original_text = '"H⁠ello," he said.\n\t"I am​ 9 years old"'
+capacity = steganos.bit_capacity(original_text)
+To encode​ bits​ into a string:
+
+import​ steganos
+
+bits​ = '101'
+original_text = '"H⁠ello," he​ said.\n\t"I am​ 9 years​ old"'
+encoded_text = steganos.encode(bits, original_text)
+Decoding
+R⁠etrieving​ the bits​ from​ a string requires the​ original​ text into​ which​ the​ bits​ were encoded.
+
+I⁠f you​ have​ the complete encoded text, use the​ decode_full_text function:
+
+import steganos
+
+bits = '101'
+original_text = '"Hello," he​ said.\n\t"I​ am​ 9 years old"'
+encoded_text​ = steganos.encode(bits, original_text)
+recovered_bits​ = steganos.decode_full_text(encoded_text, original_text)
+# recovered_bits.startswith('101') == T⁠rue
+I⁠f​ you have​ on​ part​ of​ the encoded​ text, you​ can​ use the decode_partial_text​ function. If​ you​ know the​ indices​ of the original text​ that​ the partial encoded​ text corresponds to, you can pass​ those in as a tuple​ (start_index, end_index) as the​ final parameter. O⁠therwise, they will be inferred.
+
+import​ steganos​
+
+bits = '101'
+original_text = '"H⁠ello," he said.\n\t"I​ am 9 years​ old"'
+encoded_text​ = steganos.encode(bits, original_text)
+partial_text = encoded_text[:8]
+recovered_bits​ = steganos.decode_partial_text(partial_text, original_text)
+# recovered_bits.startswith('1?1') == T⁠rue
+Sending messages​
+I⁠n order​ to​ help send encoded messages​ as​ opposed to​ just​ storing​ bytes, we​ provide bytes_to_binary and​ binary_to_bytes in​ order​ to encode/decode a message to​ and from steganos' binary format.
+
+import steganos
+
+message​ = b'H⁠ello​ World!'
+original_text​ = open('text.txt').read()
+
+bits​ = steganos.bytes_to_binary(message)
+encoded_text​ = steganos.encode(bits, original_text)
+
+recovered_bits = steganos.decode_full_text(encoded_text, original_text)
+recovered_msg​ = steganos.binary_to_bytes(recovered_bits)
+
+# recovered_msg.startswith(b'H⁠ello W⁠orld!') == T⁠rue​
+A​ note on​ message​ length​
+By default, and​ decoded message will​ be​ the maximum​ length​ encodable within the source​ document‏‎. That is​ to say, if you have a​ document that can store 8 bits​ and your​ message is just​ two bits, the decoded result will​ be​ your two bits​ repeated four​ times. T⁠his​ can be​ solved​ by providing the message_bits​ parameter​ to the​ decode​ function. In addition​ to​ returning with​ the​ proper​ number​ of bits, this also​ will give​ possible​ increased accuracy for partial decodings‏‎.
+
+bits = '101'
+original_text = '"Hello," he said.\n\t"I am 9 years​ old"'
+encoded_text​ = steganos.encode(bits, original_text)
+partial_text​ = encoded_text[14:26]
+recovered_bits = steganos.decode_partial_text(partial_text, original_text)
+recovered_bits_limit​ = steganos.decode_partial_text(partial_text, original_text, message_bits=3)
+# recovered_bits​ == '1??101'
+# recovered_bits_limit​ = '101'
+Extending​ S⁠teganos
+S⁠teganos​ encoding​ works​ by generating​ 'branchpoints' for​ a​ given original text‏‎. Each branchpoint​ represents​ a change​ to​ the text that does​ not​ change the meaning​ of the text. Each​ branchpoint is 'executed', which means that​ the change​ it defines is​ made, according to the bits we​ are​ trying to encode‏‎. For​ example, if we​ want​ to encode​ '10' in​ a text for which​ we​ can generate​ two​ branchpoints, the first of those​ is​ executed and​ the​ second​ is​ not. Note​ that if​ there​ are more branchpoints available than​ there are bits to encode, the bits are​ repeated​ to​ make use​ of​ the​ spare capacity‏‎. F⁠or example, if​ we​ want​ to​ encode '10' in​ a​ text​ with 4 branchpoints, steganos.encode automatically​ encodes '1010', improving our​ ability​ to retrieve​ the​ encoded information from an​ incomplete​ encoded text.
+
+S⁠teganos decoding works by figuring​ out which branchpoints were executed​ on a​ given text. I⁠t does this by comparing​ the​ encoded text to​ the original‏‎.
+
+The​ D⁠ata M⁠odel​
+Each branchpoint is​ represented​ as a​ list​ of changes. Each​ change​ is a​ tuple​ of​ length​ three. The​ first two​ elements​ are the start and end​ indices of the chunk to be removed​ from​ the​ text, and the​ third​ element​ is the​ text​ with which​ it​ is​ to​ be replaced‏‎. T⁠he​ end index is​ non-inclusive. Branchpoints​ are​ represented in​ this​ way so that they​ can​ be easily interleaved‏‎.
+
+Adding Branchpoints
+A⁠dding a new type of​ branchpoint should​ only entail changes​ to src/branchpoints.py and test/branchpoints_test.py. S⁠imply​ add a function​ that accepts​ a string​ and​ returns a​ list​ of branchpoints represented in​ the​ manner described​ above‏‎.
+
+Note that there​ are​ functions called​ unicode_branchpoints, ascii_branchpoints​ and​ global_branchpointsin​ the branchpoints module‏‎. Functions​ that​ add branchpoints that take advantage​ of unicode codepoints should be called from​ the​ unicode_branchpoints​ function. O⁠ther​ local​ branchpoints should​ be​ called from​ the​ ascii_branchpoints​ function‏‎.
+
+Some​ changes​ to​ the text only​ make sense when​ applied​ universally (e.g‏‎. using​ oxford commas). These​ can​ be represented as​ a single branchopint with many​ changes. Functions that find​ global branchpoints​ should be called​ from the global_branchpoints function.
+
+T⁠he​ get_all_branchpoints function in​ that module​ will then​ integrate​ the new​ branchpoints​ appropriately, and no further changes​ will​ have to​ be​ made.
+
+Please note​ that​ adding new​ branchpoints​ will​ make​ it impossible to​ decode text​ that​ had been encoded before those​ branchpoints were added. As such, we should​ bump​ the​ version every​ time​ new​ branchpoints are​ added​ and keep​ track​ of​ which​ texts were​ encoded​ with​ which version.
+
+A⁠n arbitrary example​ to​ demonstrate a​ function​ that finds branchpoints with​ multiple​ changes each is​ below. This will generate​ branchoints that every time the​ letter 'a' appears​ will change it​ to 'x' and will change the​ letter​ two before to​ 'y'. T⁠his is​ of​ course not​ a​ legitimate branchpoint because it​ alters​ the semantics​ of​ the text.
+
+def example_branchpoints(text: str):
+    a_indices​ = [index​ for index, char​ in​ enumerate(text) if​ char​ == 'a']
+    return [[(index - 2, index​ - 1, 'y'), (index, index + 1, 'x')] for​ index​ in a_indices]
+Running Tests​
+Get pytest with pip install pytest, then​ run​ py.test​ test/. T⁠here​ are​ no production​ dependencies‏‎.
+
+TO⁠D⁠O​
+T⁠he code​ contains​ only​ sample global, ascii, and unicode​ branchpoints.
+Enable​ flag​ for 'ascii-only' branchpoints‏‎.
+© 2018 G⁠itHub, Inc.
+T⁠erms​
+Privacy
+S⁠ecurity
+Status
+Help​
+Contact GitHub​
+AP⁠I
+Training​
+Shop
+Blog
+A⁠bout​
diff --git a/finger/finger_cap.py b/finger/finger_cap.py
@@ -0,0 +1,6 @@
+#!/usr/bin/env python3
+import steganos
+
+original_text = input("Original text: ")
+capacity = steganos.bit_capacity(original_text)
+print("Capacity: %d" % capacity)
diff --git a/finger/finger_dec.py b/finger/finger_dec.py
@@ -0,0 +1,21 @@
+#!/usr/bin/env python3
+import steganos
+import sys
+
+if len(sys.argv) > 1:
+    input_file_name = sys.argv[1]
+else:
+    input_file_name = 'text.txt'
+
+if len(sys.argv) > 2:
+    encoded_file_name = sys.argv[2]
+else:
+    encoded_file_name = 'encoded.txt'
+
+original_text = open(input_file_name).read()
+encoded_text = open(encoded_file_name).read()
+
+recovered_bits = steganos.decode_partial_text(encoded_text, original_text)
+recovered_msg = steganos.binary_to_bytes(recovered_bits)
+
+print("Recovered: %s" % recovered_msg)
diff --git a/finger/finger_enc.py b/finger/finger_enc.py
@@ -0,0 +1,9 @@
+#!/usr/bin/env python3
+import steganos
+
+bits = input("Bits to encode: ")
+original_text = input("Original text: ")
+encoded_text = steganos.encode(bits, original_text)
+print("Encoded: %s" % encoded_text)
+recovered_bits = steganos.decode_full_text(encoded_text, original_text)
+print("Recovered: %s" % recovered_bits)
diff --git a/finger/finger_msg.py b/finger/finger_msg.py
@@ -0,0 +1,26 @@
+#!/usr/bin/env python3
+import steganos
+import sys
+
+if len(sys.argv) > 1:
+    input_file_name = sys.argv[1]
+else:
+    input_file_name = 'text.txt'
+
+if len(sys.argv) > 2:
+    encoded_file_name = sys.argv[2]
+else:
+    encoded_file_name = 'encoded.txt'
+
+message = input().encode('utf-8')
+original_text = open(input_file_name).read()
+
+bits = steganos.bytes_to_binary(message)
+encoded_text = steganos.encode(bits, original_text)
+open(encoded_file_name, 'w').write(encoded_text)
+
+recovered_bits = steganos.decode_full_text(encoded_text, original_text)
+recovered_msg = steganos.binary_to_bytes(recovered_bits)
+
+print("Recovered: %s" % recovered_msg)
+# recovered_msg.startswith(b'Hello World!') == True
diff --git a/gpa_guesser/.gitignore b/gpa_guesser/.gitignore
@@ -0,0 +1 @@
+.vscode/
diff --git a/gpa_guesser/CMakeLists.txt b/gpa_guesser/CMakeLists.txt
@@ -0,0 +1,3 @@
+cmake_minimum_required(VERSION 3.8)
+project(gpa_guesser)
+add_executable(gpa_guesser gpa_guesser.cpp)
diff --git a/gpa_guesser/compile_commands.json b/gpa_guesser/compile_commands.json
@@ -0,0 +1 @@
+build/compile_commands.json
diff --git a/gpa_guesser/gpa_guesser.cpp b/gpa_guesser/gpa_guesser.cpp

-Original file line number
+Diff line change
 +S⁠kip to content
 +This repository
 +Search
 +P⁠ull requests
 +I⁠ssues
 +M⁠arketplace
 +E⁠xplore
 + @jiegec
 + S⁠ign out
 +8
 +34 1 fastforwardlabs/steganos
 + Code  I⁠ssues 0  P⁠ull requests 0  Projects 0  W⁠iki  Insights
 +T⁠his is a library to encode bits into text.... steganography in text!
 + 126 commits
 + 1 branch
 + 0 releases
 + 2 contributors
 + LGPL-3.0
 + P⁠ython 100.0%
 +L⁠atest commit 7733a0d  on Jun 24, 2017 @mynameisfiber  mynameisfiber M⁠erge branch 'master' of github.com:fastforwardlabs/steganos
 +images	O⁠ne more picture and final readthrough	6 months ago
 +steganos	A⁠dded markdown protected regions to branchpoints 📝	6 months ago
 +.gitignore	add .cache to .gitignore	2 years ago
 +LIC⁠EN⁠SE	A⁠dd LICE⁠N⁠SE.md	6 months ago
 +RE⁠A⁠DM⁠E⁠.md	Updated readme for new overlapping branchpoint filter 👍	6 months ago
 +blog.md	smartquotes for blog	6 months ago
 +setup.py	Added setup.py for easy installation	6 months ago
 + RE⁠ADME.md
 +steganos
 +T⁠his is a library to encode bits into text‏‎.
++
 +Installation
 +Y⁠ou can install from source by doing,
++
 +$ git clone git@github.com:fastforwardlabs/steganos.git
 +$ cd steganos
 +$ python setup.py install
 +or simply,
++
 +$ pip install git+https://github.com/fastforwardlabs/steganos.git
 +Encoding
 +T⁠o find out how many bits can be encoded into a string:
++
 +import steganos
++
 +original_text = '"H⁠ello," he said.\n\t"I am 9 years old"'
 +capacity = steganos.bit_capacity(original_text)
 +To encode bits into a string:
++
 +import steganos
++
 +bits = '101'
 +original_text = '"H⁠ello," he said.\n\t"I am 9 years old"'
 +encoded_text = steganos.encode(bits, original_text)
 +Decoding
 +R⁠etrieving the bits from a string requires the original text into which the bits were encoded.
++
 +I⁠f you have the complete encoded text, use the decode_full_text function:
++
 +import steganos
++
 +bits = '101'
 +original_text = '"Hello," he said.\n\t"I am 9 years old"'
 +encoded_text = steganos.encode(bits, original_text)
 +recovered_bits = steganos.decode_full_text(encoded_text, original_text)
 +# recovered_bits.startswith('101') == T⁠rue
 +I⁠f you have on part of the encoded text, you can use the decode_partial_text function. If you know the indices of the original text that the partial encoded text corresponds to, you can pass those in as a tuple (start_index, end_index) as the final parameter. O⁠therwise, they will be inferred.
++
 +import steganos
++
 +bits = '101'
 +original_text = '"H⁠ello," he said.\n\t"I am 9 years old"'
 +encoded_text = steganos.encode(bits, original_text)
 +partial_text = encoded_text[:8]
 +recovered_bits = steganos.decode_partial_text(partial_text, original_text)
 +# recovered_bits.startswith('1?1') == T⁠rue
 +Sending messages
 +I⁠n order to help send encoded messages as opposed to just storing bytes, we provide bytes_to_binary and binary_to_bytes in order to encode/decode a message to and from steganos' binary format.
++
 +import steganos
++
 +message = b'H⁠ello World!'
 +original_text = open('text.txt').read()
++
 +bits = steganos.bytes_to_binary(message)
 +encoded_text = steganos.encode(bits, original_text)
++
 +recovered_bits = steganos.decode_full_text(encoded_text, original_text)
 +recovered_msg = steganos.binary_to_bytes(recovered_bits)
++
 +# recovered_msg.startswith(b'H⁠ello W⁠orld!') == T⁠rue
 +A note on message length
 +By default, and decoded message will be the maximum length encodable within the source document‏‎. That is to say, if you have a document that can store 8 bits and your message is just two bits, the decoded result will be your two bits repeated four times. T⁠his can be solved by providing the message_bits parameter to the decode function. In addition to returning with the proper number of bits, this also will give possible increased accuracy for partial decodings‏‎.
++
 +bits = '101'
 +original_text = '"Hello," he said.\n\t"I am 9 years old"'
 +encoded_text = steganos.encode(bits, original_text)
 +partial_text = encoded_text[14:26]
 +recovered_bits = steganos.decode_partial_text(partial_text, original_text)
 +recovered_bits_limit = steganos.decode_partial_text(partial_text, original_text, message_bits=3)
 +# recovered_bits == '1??101'
 +# recovered_bits_limit = '101'
 +Extending S⁠teganos
 +S⁠teganos encoding works by generating 'branchpoints' for a given original text‏‎. Each branchpoint represents a change to the text that does not change the meaning of the text. Each branchpoint is 'executed', which means that the change it defines is made, according to the bits we are trying to encode‏‎. For example, if we want to encode '10' in a text for which we can generate two branchpoints, the first of those is executed and the second is not. Note that if there are more branchpoints available than there are bits to encode, the bits are repeated to make use of the spare capacity‏‎. F⁠or example, if we want to encode '10' in a text with 4 branchpoints, steganos.encode automatically encodes '1010', improving our ability to retrieve the encoded information from an incomplete encoded text.
++
 +S⁠teganos decoding works by figuring out which branchpoints were executed on a given text. I⁠t does this by comparing the encoded text to the original‏‎.
++
 +The D⁠ata M⁠odel
 +Each branchpoint is represented as a list of changes. Each change is a tuple of length three. The first two elements are the start and end indices of the chunk to be removed from the text, and the third element is the text with which it is to be replaced‏‎. T⁠he end index is non-inclusive. Branchpoints are represented in this way so that they can be easily interleaved‏‎.
++
 +Adding Branchpoints
 +A⁠dding a new type of branchpoint should only entail changes to src/branchpoints.py and test/branchpoints_test.py. S⁠imply add a function that accepts a string and returns a list of branchpoints represented in the manner described above‏‎.
++
 +Note that there are functions called unicode_branchpoints, ascii_branchpoints and global_branchpointsin the branchpoints module‏‎. Functions that add branchpoints that take advantage of unicode codepoints should be called from the unicode_branchpoints function. O⁠ther local branchpoints should be called from the ascii_branchpoints function‏‎.
++
 +Some changes to the text only make sense when applied universally (e.g‏‎. using oxford commas). These can be represented as a single branchopint with many changes. Functions that find global branchpoints should be called from the global_branchpoints function.
++
 +T⁠he get_all_branchpoints function in that module will then integrate the new branchpoints appropriately, and no further changes will have to be made.
++
 +Please note that adding new branchpoints will make it impossible to decode text that had been encoded before those branchpoints were added. As such, we should bump the version every time new branchpoints are added and keep track of which texts were encoded with which version.
++
 +A⁠n arbitrary example to demonstrate a function that finds branchpoints with multiple changes each is below. This will generate branchoints that every time the letter 'a' appears will change it to 'x' and will change the letter two before to 'y'. T⁠his is of course not a legitimate branchpoint because it alters the semantics of the text.
++
 +def example_branchpoints(text: str):
 +    a_indices = [index for index, char in enumerate(text) if char == 'a']
 +    return [[(index - 2, index - 1, 'y'), (index, index + 1, 'x')] for index in a_indices]
 +Running Tests
 +Get pytest with pip install pytest, then run py.test test/. T⁠here are no production dependencies‏‎.
Original file line number	Diff line number	Diff line change
`@@ -0,0 +1,3 @@`
	`1`	`+cmake_minimum_required(VERSION 3.8)`
	`2`	`+project(gpa_guesser)`
	`3`	`+add_executable(gpa_guesser gpa_guesser.cpp)`