UberStudent Forums

Help and Community for Users of Uberstudent
Return to UberStudent Home Page
It is currently Wed Nov 21, 2018 10:09 am

All times are UTC




UberStudent Logo Shirt
Hey! You can wear this UberCool UberStudent Logo Shirt

Post new topic Reply to topic  [ 12 posts ]  Go to page 1, 2  Next
Author Message
PostPosted: Wed Aug 22, 2018 6:51 am 
Offline
Established UberStudent User

Joined: Sun Aug 19, 2018 2:09 am
Posts: 10
I have been thinking of my own long term project, where UberStudent as is would be used as base of data additions and some specialised software components.

There will be no attempt to re-badge the OS, but it may be given a some superficial changes such as a distinct background that highlights its specialist nature and the package name, rather than the OS would be used to call attention to its content. I would bee happy to adopt some standard format of reference such as whatever "hosted on UberStudent OS".

I dare say I might not the only one who may want to use UberStudent this way, as a stable platform for specific research using a common suite of applications and an undemanding OS. The package would contain only be available to members of the research group, as copyrighted materials would otherwise be at issue.

This is not an enquiry about legalities of using UberStudent, but rather developing a policy that catered for such uses and at the same time fed back into your OS development; a symbiotic arrangement that may encourage similar ventures to my own. The plus is that UberStudent would gain adherents amongst a group of people that may never have otherwise looked in such direction; so long as it could be co-promoted with the research project itself.

I am under no tight schedule with this project, so I suggest if a policy is not already in place, that my project might be used as an example in forming one. I undertake not to act on anything said until an official statement is given.


Top
 Profile  
 

Click Here to Get Inexpensive UberStudent Discs


Click here to get UberStudent discs delivered to your mailbox
Delivered Directly to Your Mailbox!

PostPosted: Wed Aug 22, 2018 1:32 pm 
Offline
UberStudent Founder and Lead Developer
User avatar

Joined: Sun Nov 11, 2012 8:54 am
Posts: 958
Location: UberStudent Headquarters
What's your overall goal?

_________________
UberStudent Must Become Self-Sustaining. If UberStudent and my dedicated support of it has benefited you, it's important to make a donation. Thanks!

The UberStudent Headquarters:
    * Mobo: MSI 870-G45
    * Processor: AMD Phenom II X4 3.4GHz
    * RAM: G.SKILL 8GB DDR3 1600
    * Graphics: MSI NVIDIA GeForce GTX 750 Ti
    * Drive: OCZ Vertex SSD
    * Case: Rosewill ATX Mid Tower
    * Monitors: 2 x Dell UltraSharp
    * With UberStudent 4.3 Development Build
You should build your own computers, too!


Top
 Profile  
 
PostPosted: Wed Aug 22, 2018 2:40 pm 
Offline
Established UberStudent User

Joined: Sun Aug 19, 2018 2:09 am
Posts: 10
Stephan, thanks.

It is to load a package that includes an OS and Applications with a library of central works, and the means to scan in searchable PDFs and convert this into a ArangoDB of paragraphs of a each work that are assembled into different editions, and with various referencing systems. It serves the purpose of looking up references through various means and finding their context.

The other aspect is having a stocked library of shared in Zotero, then some crafted Lyx documents. There are few things I would like to add to the OS including short unique ids, and some other web forms etc.,. The idea is that it is a scholarly ebook production house, a research facility, a communications hub and a forum for as a 'republic of letters' format and as a common publishing resource. I do not expect to even start it until; sometime next year.

UberStudent presents a really strong foundation for the sort of enclosed single purpose OS I was looking to create, I had planned slapping something together myself, but UberStudent far outstrips anything I could do.

This is completely separate from the tutoring which I have followed your advice and gone towards Edubuntu.


Top
 Profile  
 
PostPosted: Thu Aug 23, 2018 4:08 am 
Offline
Established UberStudent User

Joined: Sun Aug 19, 2018 2:09 am
Posts: 10
I thought I might add how I expect this thing to work

The vital part is using arangoDB, after a long experiment with all the things that have not worked well for studying bodies of literature, my conclusion is that well informed research has to become a unified system of reference and review.

Using A json parser DB each work on request can generate a different file format, but also the particular order and ornamentation of the ebook including all its editions. Being able to study one work while being in possession of the works it refers to is the ultimate aim, having a data-base of paragraphs is the quick way of finding something, the other is to produce a work that is marked in the way most useful to the particular study.

If I can get that right all the rest follows as an extension of that working logic, so in the end I really don't know exactly how that would change the OS, but I want to begin on an OS that has no fat and is designed for education first. Hence I was going to simply strip down one, which seems not only pointless now but also counterproductive.

The idea is that education, research and converting a body of literature within a group already interested in the area brings coherence that would be otherwise lost. The first is to get the literature conversion tools produced (through html applications, JavaScript, node.js etc.), hence the need to generate short world exclusive IDs so that the id of a paragraph associates it within a work. This means any work can be reduced to a sequence of ids. The format another sequence of ids within a markup. An edition being a different sequence and so on.

If that is nailed then its is a matter of following through, I cannot guess at what the end product will look like or contain, I have a fairly good idea of the functionality that would be needed. I have over the years experimented with nearly every sort of text production, including TEI and in the end realised that small data sets, where items are kept in plain text is the best way forward. we cannot know what future needs may be, so at a basic level we need to preserve content within a rich and accumulating environment that could be easily transformed into other formats.

Researcher X is using Darwin's first edition, produces the primary DB on it, with page numbers etc. Researcher Y wants to use the fifth edition, each paragraph of raw text is compared and listed in its place, each different paragraph is added in, along with the page number references. This needs workflow controls, etc., logs and status updates. Likewise, when different language versions are included.

However, the works need to be generated in a process of study, and the results need to be combined as a course resource for those following. It is a fairly ambitious project. The idea that by being unbound to paper, so much more of what researchers do becomes available in a way that makes more researchers educate themselves, that researching and teaching form a continuum and not just a publishing cycle.

Sorry for this rant, but hedging around it seem unfair.


Top
 Profile  
 
 Post subject: Re: eBook Database
PostPosted: Fri Aug 24, 2018 10:21 am 
Offline
UberStudent Staff
User avatar

Joined: Mon Jan 21, 2013 7:18 pm
Posts: 126
Gregory E Schofield wrote:
The idea is that it is a scholarly ebook production house, a research facility, a communications hub and a forum for as a 'republic of letters' format and as a common publishing resource.

Will you be using Foxxy to write this ebook database software? Limiting your ebook database to being accessible from only one operating system is antithetical to the notion of a respublica literaria; I predict that such limitation would doom your ebook database to being unused by those who would have been most interested in using it.

_________________
Cybermancy.net - Creativity via code


Top
 Profile  
 
 Post subject: Re: eBook Database
PostPosted: Fri Aug 24, 2018 1:11 pm 
Offline
Established UberStudent User

Joined: Sun Aug 19, 2018 2:09 am
Posts: 10
Damion Lunin wrote:
Will you be using Foxxy to write this ebook database software? Limiting your ebook database to being accessible from only one operating system is antithetical to the notion of a respublica literaria; I predict that such limitation would doom your ebook database to being unused by those who would have been most interested in using it.


Absolutely no intention of restricting the publications to one platform. The idea would be to host publications which could be used over the net, or download single works or matrices (a one or more target works and all their references). The use of an OS is for a research and creation environment, set up and ready to go. However, getting there begins with solving the the publishing problem, which I want to extend into an email system, where 'letters' can be gathered and printed out for reading or achieved as collections (a correspondence).

My problem is I have seen problems with the common solutions and have been waiting too long for a solution. Some of this includes storage, another packaging, and lastly controlling the form of what is produced for reading. I suspect a version of epub packaging could work. That is the 'book' is single package, a single data base, perhaps producing no more an a HTML version with some features switched on and others off, but within the OS what can be done with the same data-publication is much wider.

The other thing as this is a scholarly venture, as publishing systems go, even if I get it right, it will be a slow process before it gains much ground. The truth is I have lost patience and would prefer to get some proof of concept running, but this would be in an OS environment, where I can botch together 'solutions' and then look at how they might be coded better and packaged usefully. If any others want to be involved at the early stage, I simply copy the OS.

It is not just a technical problem, but one of design where identities play a central role and must be flexible without becoming incoherent.

I figure to begin with a system that makes it easy to dissemble a searchable PDF, logs the stage of processing and keeps the 'workshop' environment coherent, and then allows it to be resembled in a variety of forms and formats is Stage I.

So I only know Foxxy by name and description, ArangoDB seems to tick all the right boxes, but now it is really just the determination to get it going, as for a long time I had pinned my hopes on Epub3, which did not prove the breakthrough it had promised. Damion thanks for your input, I know to most people this project might sound either a little daft or completely unnecessary, but it bringing quality works to the public has been something I have been pursuing since the days of bulletin boards.


Top
 Profile  
 
 Post subject: Re: eBook Database
PostPosted: Sat Aug 25, 2018 9:22 am 
Offline
UberStudent Staff
User avatar

Joined: Mon Jan 21, 2013 7:18 pm
Posts: 126
Gregory E Schofield wrote:
So I only know Foxxy by name and description, ArangoDB seems to tick all the right boxes, but now it is really just the determination to get it going, as for a long time I had pinned my hopes on Epub3, which did not prove the breakthrough it had promised.

It looks like you could utilize the annotation feature of ePub3 to implement your per-paragraph identities, and you could utilize the ePub OCF 3 container format for encapsulating a set of related resources into a single ePub3 container file. I think the real magic will be in the ebook database software that you create, not in the ebook format itself. Foxxy is a tool for creating database software using ArangoDB and the Foxx-Framework.

_________________
Cybermancy.net - Creativity via code


Top
 Profile  
 
 Post subject: Re: eBook Database
PostPosted: Mon Aug 27, 2018 1:34 am 
Offline
Established UberStudent User

Joined: Sun Aug 19, 2018 2:09 am
Posts: 10
Damion Lunin wrote:
It looks like you could utilize the annotation feature of ePub3 to implement your per-paragraph identities, and you could utilize the ePub OCF 3 container format for encapsulating a set of related resources into a single ePub3 container file. I think the real magic will be in the ebook database software that you create, not in the ebook format itself. Foxxy is a tool for creating database software using ArangoDB and the Foxx-Framework.


Damion that extra reference really helped, Foxxy and Foxx-framework look extremely promising and light-weight for what I can make out. Many years ago I reached a dead end with Relational DBs, and surmised a text fragment system was the answer to non-predictable data sources.

Although Json type databases have been developing for some time I only become aware of them recently, but I had a mental list of what they needed to do and having passed through mongoDB, that ticked nearly all the list, to AranagoDB that ticked off all of the list and then added in some. My working assumption was to get the proof of work done as soon as I can manage, and as best as I could botch together as a proof of concept.

Your latest reference made clear what I was my complete misunderstanding of Foxxy and Foxxx-Framework which revived an older idea of container wrapped websites, which a literature container suggests --- I had used as my model for the direction to follow, self-contained, but also externally linked.

I also believe that the the present internet structure has major flaws and will not stand the test of time. Literature is by nature mobile, if a site dies, the literature it provided needs to continue to circulate, especially important for intellectual and historical works, is that they should be self-dependant. Of course that isn't new pdfs do that, except left the the vagaries of file-naming and tagging does not keep them from being lost.

ISBN were created to unambiguously identify a printed work. Epublications need a unique identifier that works in reverse, that it can unambitiously identify a work where-ever it might be found, and that any fragment from a work would carry that identity with it. EPUB3 has internal strictures, but none of the readers can reliably extract fragments of any kind, and none of the systems of extraction allow for none-locational file identities.

In 100 years we will want to get the refereed publication wherever it can be found, in which case a www address will be useless. So while thhis starts with books the greater context plays it role. The point of a container is simply to hold an identity and its content. The point of ArangoDB and other Json type data, is that simple parsing can reconstitute a book even after JS script has only archaeological interest.

Damion, thanks for you help and advise I was systematically misreading what Foxxy was doing until now, which really makes me feel stupid in hindsight.

Greg


Top
 Profile  
 
 Post subject: Re: eBook Database
PostPosted: Mon Aug 27, 2018 9:26 am 
Offline
UberStudent Staff
User avatar

Joined: Mon Jan 21, 2013 7:18 pm
Posts: 126
Gregory E Schofield wrote:
ISBN were created to unambiguously identify a printed work. Epublications need a unique identifier that works in reverse, that it can unambitiously identify a work where-ever it might be found

ISBN corresponds to identity; URL ("Uniform Resource Locator") corresponds to location; as identity and location are separate concepts they can never be merged perfectly. Regarding electronic resources, the general working solution for translating identity to location is a search engine such as Google.com; if you have an ISBN number and need a resource location, you "Google" it. The difficulty of translating identity to location and Google's success with such is what leveraged them into being a multi-billion-dollar company. A centralized database of academic ebooks would be more direct and efficient than a general search engine though. Further, the database's effectiveness as a location provider increases if additionally the database's contents are indexable by external search engines (in other words, the database's contents are visible on publicly browsable webpages).

_________________
Cybermancy.net - Creativity via code


Top
 Profile  
 
 Post subject: Re: eBook Database
PostPosted: Tue Aug 28, 2018 2:00 pm 
Offline
Established UberStudent User

Joined: Sun Aug 19, 2018 2:09 am
Posts: 10
Damion Lunin wrote:
ISBN corresponds to identity; URL ("Uniform Resource Locator") corresponds to location; as identity and location are separate concepts they can never be merged perfectly. Regarding electronic resources, the general working solution for translating identity to location is a search engine such as Google.com; if you have an ISBN number and need a resource location, you "Google" it. The difficulty of translating identity to location and Google's success with such is what leveraged them into being a multi-billion-dollar company. A centralized database of academic ebooks would be more direct and efficient than a general search engine though. Further, the database's effectiveness as a location provider increases if additionally the database's contents are indexable by external search engines (in other words, the database's contents are visible on publicly browsable webpages).


Damion thanks again for your replies.

Instead of ISBN being issuing authority, a short timestamp and registered issuer of 4 base36 letters (A-Z, 0-9) can generate the unique codes backwards millisecond first (83544922182808102-AAAA). Identity can then be read into a localised position;

This millisecond, second, minute, hour, day and month a random sequence that can be kept in directory pyramid ./8/3/5/4/83544922182808102-AAAA a four level descent of 10,000 subdirectories. The local position is determined by the identity expression. One command establishes if the package is present or not in the local pyramid, if no other method is available. It is the fallback position where even scatter packages can be gathered into a sensible reference stack.

Digital storage when tackled as a building block of location, means point to a stack is sufficient for quick blind search. Browsing extracts data to become from the package. Pyramid stacks, can be separated and merged and become a singularity via references; this is the basic reference system, using a footnote reference that includes just the ID, can begin the task with a list of participant stacks. So the mechanical fetching need not rely on anything very sophisticated to work --- this is the foundation.

The preliminary system I will be using on Ubserstudent is to use this information to simply rename links so that browsing can be arranged to suit. However, that is just a jerry-rigged for me and perhaps a few others to work out what is needed to create works of reference, in that process we will have to assault the larger questions. But working from the underside up has the advantage that in being true to getting those details right and clear, partly solves the later problems.

The rule of the short negative, quickness in finding the right package is how quickly not finding the file takes. Hence no big data base, just a registry of repositories, that they themselves can redirect. If Safe-Network takes off it would be simpler still. It does not allow browsing, but it does make referencing a breeze based on a fully marked-up book. Stage 3 the original work most of editions and all of its references have been made, Stage 1 an edition is has been done, Stage 2 edition differences and have been partly catered for (an ongoing task).

The idea is that a footnote to Darwin's 5th edition para 256-259 would be 83544922182808102-AAAA/ed.5={(para.256) -(para.259)} 0r something similar, while a public domain retrieval. This idea is extended put over the top a familiar nomenclature of book titles etc., but as a html, TEI or Zotero snippet that is used as a file selector, like the Zotero store. Which would make browsing works simple over one own collection, but also scalable on a world wide scale --- the direct referencing needs only stack addresses.

I want to extend the identity system to authors so that they might be micro-paid directly for their works while living and become part of the public domain to 80 years their birth if they had died beforehand (the old more sensible copyright. In other words shift the payment system directly to the author's webpurse to keep prices low and authors reimbursed.

Likewise, it should not depend on size, rather so long as formats could be generated (pdfs and epubs), the packages could be referenced within them as a bonus. So starting with an area of interest with a small number of people would begin to infiltrate the system and therefore may recruit others to start using it in their own studies (hence the downloadable OS research and publishing tool set.

Directory pyramids have a lovely feature; say I had one big pyramid, I could separate that on any basis, say into 100 stacks, kept in 100 places. I would then just keep the package information in a catalogue that points to the location, from this is it a fairly trivial matter that would allow browsing , author searches etc.,. What is happening with google search today politically, was anticipated by always thinking of a dispersed system of pointers and registries and a form of stack storage. That is these can be collated, but also fragmented, dispersed and assembled, gives robustness.

The approach is leave general outlines as general answers to general problems, but to begin with the practical task with this in mind. So I start off with the type of literature I am interested in, adapt the OS environment to this task alone, where hopefully it is of interest to others who can contribute more than I can, and it can start living its own life. I want an OS that explains itself and has the tools, the notes and documentation, so people can begin where they are most comfortable, and research and publishing arm develops. I don't want to burden anyone with my plans, but I am quiet happy to share it if anyone has an interest in it.

Despite the grandiose 'ifs and might bes' none of that need be in place aside from the bare minimal, a stack space and the tools to transform texts into data items.

Damion I very much appreciate your comments --- Greg.


Top
 Profile  
 
Display posts from previous:  Sort by  
Post new topic Reply to topic  [ 12 posts ]  Go to page 1, 2  Next

Click Here to Get Inexpensive UberStudent Discs


Click here to get UberStudent discs delivered to your mailbox
Delivered Directly to Your Mailbox!



All times are UTC


Who is online

Users browsing this forum: No registered users and 1 guest


You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot post attachments in this forum

Search for:
Jump to:  
cron
Powered by phpBB® Forum Software © phpBB Group