Gentoo Archives: gentoo-soc

From: Zhang Zongyu <zzy2529420793@×××××.com>
To: gentoo-soc@l.g.o
Subject: Re: [gentoo-soc] Weekly Report: Big Data Infrastructure and Maven Overlay in Week 5
Date: Sun, 05 Jul 2020 17:23:28
Message-Id: CAMBJAApTuRgiEKoev69A03=XugO=z2VivgNxus+txBwaZdZbKg@mail.gmail.com
In Reply to: Re: [gentoo-soc] Weekly Report: Big Data Infrastructure and Maven Overlay in Week 5 by EBo
1 Hello,
2
3 Here is the proposal of this project.[1]
4 In a nutshell, the goal of this project is to intergrate Portage with
5 Maven, or to translate Maven pom.xml into ebuild files.
6 And the most related project should be java-ebuilder, which aims at
7 processing pom.xml and forming an equivalent ebuild.[2]
8 The deliverable of this project will be an overlay resolving all the
9 dependencies of spark, which is widely used for big data processing, so my
10 mentor named the project "big data infrastructure".
11
12 Actually it is not related to big data processing or any "big data".
13
14 I am sorry if it confused you, and I am glad that you are interested in
15 this project.
16
17 Regards,
18 Zhang Zongyu
19
20 [1] https://summerofcode.withgoogle.com/projects/#4994566568017920
21 [2] https://github.com/6-6-6/java-ebuilder
22
23 EBo <ebo@×××××××.com> 于 2020年7月6日周一 上午12:18写道:
24
25 > Out of curiosity, can you point me to any project write up that was
26 > probably part of the proposal? One of my last big-data projects started
27 > out with a 3.5 petabyte repository and filtered on down to cover 10
28 > million contiguous kilometers. We did not use Maven, and I would be
29 > curious to see where the project is going. Now that I know about this I
30 > will try to follow it a bit more.
31 >
32 > Best of success!
33 >
34 > On Jul 5 2020 10:05 AM, Zhang Zongyu wrote:
35 > > Hello,
36 > >
37 > > This week I have done a lot of repetitive work. That is, to go
38 > > through
39 > > all of the ebuild files in Gentoo Repository and to get them the
40 > > equivalent GroupId and ArtifactId in Maven Central. There are more
41 > > than 500 packages in dev-java/ and java-virtual/ category, and
42 > > sometimes one Gentoo package may correspond to more than one Maven
43 > > artifact, so it took me a lot of time to complete them. And here is
44 > > my
45 > > lookup up table which covers all packages in dev-java/ and maps
46 > > "category/package:slot" into "groupId:artifactId".[1] As a result,
47 > > spark-core can make use of more than 100 Gentoo native packages now.
48 > >
49 > > Besides, I have also added java-ebuilder some features. It can check
50 > > the metadata of a package and get the correct SLOT, and it can make
51 > > ebuild unpack "Maven Central distributed source files" into proper
52 > > ${JAVA_SRC_DIR} now.
53 > >
54 > > Next week I am going to test and add jaxb (javax.xml.bind:jaxb-api
55 > > and
56 > > javax.xml.bind:jaxb-impl) and some newer version packages which are
57 > > required by spark-core, and I hope the newer version ebuild files
58 > > could be accepted by Gentoo-Java team. The prototype of the ebuild
59 > > files that are just generated by java-ebuilder are listed in [2].
60 > >
61 > > Regards,
62 > > Zhang Zongyu
63 > >
64 > > [1] lookup table containing all packages from dev-java/
65 > >
66 > >
67 > https://github.com/6-6-6/JEbuilder/blob/master/app-portage/java-ebuilder/files/scripts/LUT/LUT
68 > > [2] a prototype overlay
69 > > https://github.com/6-6-6/spark-overlay/tree/master/dev-java
70 >
71 >
72 >

Replies