You are viewing a single comment's thread from:

RE: Testing Machine Learning tools for optimizing the Steem experience

in #machinelearning7 years ago

If you need machines to run your computation you surely can use the BOINC and Gridcoin network. I am sure there will be many willing to crunch your WUs. Look at Anderson Attack project, volunteers were able to finish their workload in few months (2? iirc). After filling it's purpose and publishing paper, the project ceased to exist.
There is a catch though. You must develop your application with data distribution and scaling in mind. This shows to be a challenge for @dutch.
If you get a suitable app, but can't set up a boinc server, Gridcoin community and project admins can help you. Or you can be like Yafu project and run the server from laptop.
I wish you a good luck.

Sort:  

Great, I was just talking about it to Alex while watching a great presentation:

We need an n-gram model to improve the YouTube output to account for unknown words like names like for example BitShares and DEX. An appropriate language model is the best sollution to the current transcription problem because of the different and highly specific terms we use in our crypto space.
KenLM: kheafield.com/code/kenlm
What you do is create a LM for countering weird stuff. They call it the Tchaikovsky problem.

Training time on an TitanX ~= 30 days We can feed it all the Mumble talks from the past years, it needs about 10.000 hours of training. If we feed it a good language model by leveraging steem, bitsharestalk, bitcointalk then we'll have a better engine than YouTube. Can use gridgoin network for this and have it in less than a week.

You should contact admin of Citizen Science Grid. They are experimenting with neural learning on boinc. You will soon run into a problem: If it takes 30 says on a TitanX (nvidia), are you able to split the task to hundredths of computers?

Thanks, I have no idea, well there are also the #gridcoin people I could ask...

that could possibly be a solution, if you really wanted to give it a try ... https://boinc.berkeley.edu/trac/wiki/PythonApps

Thanks Alex but Boinc is a volunteer project, so you mean to add a new project? I would have thought peope already had a speech-to-text machine learning task running on Boinc but I don't see it in the list of projects...

I was just saying it should be possible. I believe though (if I recall correctly) new projects would have to be proposed to the community, and you have to get people interested in dedicating time to your project.

However, this may be a better question to direct to @tomasbrod, since he originally suggested boinc/gridcoin at the top of this thread!

Coin Marketplace

STEEM 0.19
TRX 0.16
JST 0.034
BTC 64116.01
ETH 2758.41
USDT 1.00
SBD 2.65